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ENDONUCLEASE DIGESTION-LIGASE DETECTION REACTION 

PROCESS 

5 

This application claims the benefit of U.S. Provisional Patent 
Application Serial No. 60/125,251 filed March 19, 1999. 

This invention was developed with government funding under 
National Institutes of Health Grant Nos. GM-41 337-06, GM-43552-05, GM- 
10 42722-07, and GM-5 1628-02. The U.S. Government may have certain rights. 



FIELD OF THE INVENTION 

The present invention is directed to a process involving coupled 
polymerase chain reaction ("PCR"), restriction endonuclease digestion ("RE"), 
1 5 and ligase detection reaction ("LDR"). 

BACKGROUND OF THE INVENTION 

Cancer Detection 

As the second leading cause of death in this country, almost 
600,000 people will die from cancer per year making cancer one of the most 
alarming of all medical diagnosis. Lifetime risks for developing invasive cancers 

20 in men and women are 50 percent and 33 percent, respectively. Expectations are 
that more than 1 .2 million new cases of cancer will be diagnosed in the United 
States in 1995. Healthcare expenses for cancer in 1994 were approximately $104 
billion. However, the full impact of cancer on families and society is not 
measured only by the amount of money spent on its diagnosis and treatment. A 

25 significant number of people are stricken with cancer in their most productive 

years. Cancers accounted for 18 percent of premature deaths in 1985 and in 1991 
more than 9,200 women in the U.S. died from breast cancer before the age of 55. 

Currently, diagnosis of cancer is based on histological evaluation 
of tumor tissue by a pathologist. After a cancer is diagnosed, treatment is 

30 determined primarily by the extent or stage of the tumor. Tumor stage is defined 
by clinical, radiological, and laboratory methods. Standardized classification 
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sy stems for the staging of tumors have been developed to clearly convey clinical 
information about cancer patients. Staging provides important prognostic 
information and forms the basis of clinical studies which allow the testing of new 
treatment strategies. A staging system was developed (TNM staging system), 

5 which classifies tumors according to the size of the primary tumor, the number of 
regional lymph nodes in which cancer is found, and the presence or absence of 
metastases to other parts of the body. Smaller cancers with no affected lymph 
nodes and no distant metastases are considered early stage cancers, which are 
often amenable to cure through surgical resection. A common measure of 

10 prognosis is the 5-year survival rate, the proportion of patients alive five years 
after the diagnosis of a cancer at a given stage. While 5-year survival rates for 
many cancers have improved over the last few decades, the fact that some early 
stage cancers recur within five years or later has led researchers to explore other 
additional prognostic markers including histological grade, cytometry results, 

1 5 hormone receptor status, and many other tumor markers. Most recently, 
investigators have explored the use of molecular alterations in cancers as 

prognostic indicators. 

Genetic alterations found in cancers, such as point mutations and 
small deletions can act as markers of malignant cells. 

20 

Detection of Minority Nucleic Acid Sequences 

A number of procedures have been disclosed to detect cancer using 
PCR. Sidransky, et aL, "Identification of ras Oncogene Mutations in the Stool of 
Patients with Curable Colorectal Tumors," Science 256: 102-05 (1992) detects 

25 colon cancer by identification of K-ras mutations. This involves a PCR 

amplification of total DNA, cloning into a phage vector, plating out the phage, 
repeated probing with individual oligonucleotides specific to several different K- 
ras mutations, and counting the percentage of positive plaques on a given plate. 
This is a technically difficult procedure which takes three days to complete, 

30 whereby the ratio of mutant to wild-type DNA in the stool sample is determined. 
Brennan, et al., "Molecular Assessment of Histopathological Staging in 
Squamous-Cell Carcinoma of the Head and Neck," N. Engl. J. Med. 332(7): 429- 
35 (1995), finds p53 mutations by sequencing. This specific mutation is then 



probed for in margin tissue using PCR amplification of total DNA, cloning into a 
phage vector, plating out the phage, probing with an individual oligonucleotide 
specific to the mutation found by sequencing, and counting the percentage of 
positive plaques on a given plate. Berthelemy, et al., "Brief Communications- 
5 Identification of K-ras Mutations in Pancreatic Juice in the Early Diagnosis of 
Pancreatic Cancer," Ann. Int. Med. 123(3): 188-91 (1995) uses a PCR/restriction 
enzyme process to detect K-ras mutations in pancreatic secretions. This technique 
is deficient, however, in that mutations are not quantified. Similarly, Tada, et al., 
"Detection of ras Gene Mutations in Pancreatic Juice and Peripheral Blood of 
10 Patients with Pancreatic Adenocarcinoma," Cancer Res. 53: 2472-74 (1993) and 
Tada, et al., "Clinical Application of ras Gene Mutation for Diagnosis of 
Pancreatic Adenocarcinoma," Gastroent. 100: 233-38 (1991) subject such samples 
to allele-specific PCR to detect pancreatic cancer. This has the disadvantages of 
providing false positives due to polymerase extension off normal template, 
1 5 requiring electrophoretic separation of products to distinguish from primer dimers, 
being unable to multiplex closely-clustered sites due to interference of 
overlapping primers, being unable to detect single base or small insertions and 
deletions in small repeat sequences, and not being practically suitable for 
quantification of mutant DNA in a high background of normal DNA. Hayashi, et 
20 al., "Genetic Detection Identifies Occult Lymph Node Metastases Undetectable by 
the Histopathological Method," Cancer Res. 54: 3853-56 (1994) uses an allele- 
specific PCR technique to find K-ras or p53 mutations to identify occult lymph 
node metastases in colon cancers. A sensitivity of one tumor cell in one thousand 
of normal cells is claimed; however, obtaining quantitative values requires 
25 laborious cloning, plating, and probing procedures. In Mitsudomi, et al., 

"Mutations of ras Genes Distinguish a Subset of Non-small-cell Lung Cancer Cell 
Lines from Small-cell Lung Cancer Cell Lines," Oncogene 6: 1353-62 (1991), 
human lung cancer cell lines are screened for point mutations of the K-, H-, and 
N-ras genes using restriction fragment length polymorphisms created through 
30 mismatched primers during PCR amplification of genomic DNA. The 

disadvantages of such primer-mediated RFLP include the requirement of 
electrophoretic separation to distinguish mutant from normal DNA, limited 
applicability to sites that may be converted into a restriction site, the requirement 
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for additional analysis to determine the nature of the mutation, and the difficulty 
in quantifying mutant DNA in a high background of normal DNA. Further, these 
procedures tend to be laborious and inaccurate. 

Coupled PCR/ligation processes have been used for detection of 
5 minority nucleotide sequences in the presence of majority nucleotide sequences. 
A PCR/LDR process is used in Frenkel, "Specific, Sensitive, and Rapid Assay for 
Human Immunodeficiency Virus Type 1 pol Mutations Associated with 
Resistance to Zidovudine and Didanosine," J. Clin. Microbiol 33(2): 342-47 
(1995) to detect HIV mutants. This assay, however, cannot be used for multiplex 

10 detection. See also Abravaya, et al., "Detection of Point Mutations With a 

Modified Ligase Chain (Gap-LCR)," Nucl. Acids Res. 23(4): 675-82 (1995) and 
Balles, et al., "Facilitated Isolation of Rare Recombinants by Ligase Chain 
Reaction: Selection for Intragenic Crossover Events in the Drosophila optomotor- 
WzWGene," Molec. Gen. Genet. 245: 734-40 (1994). 

1 5 Colorectal lesions have been detected by a process involving PCR 

amplification followed by an oligonucleotide ligation assay. See Jen, et al., 
"Molecular Determinants of Dysplasia in Colorectal Lesions," Cancer Res. 54: 
5523-26 (1994) and Redston, et al., "Common Occurrence of APC and K-ras 
Gene Mutations in the Spectrum of Colitis-Associated Neoplasias," Gastroenter. 

20 108: 383-92 (1995). This process was developed as an advance over Powell, et 
al., "Molecular Diagnosis of Familial Adenomatous Polyposis," N. Engl. J. Med. 
329(27): 1982-87 (1993). These techniques tend to be limited and difficult to 
carry out. 

Other procedures have been developed to detect minority 
25 nucleotide sequences. Lu, et al., "Quantitative Aspects of the Mutant Analysis by 
PCR and Restriction Enzyme Cleavage (MAPREC)" PCR Methods and Appl. 3: 
176-80 (1993) detects virus revertants by PCR and restriction enzyme cleavage. 
The disadvantages of MAPREC include the requirement for electrophoretic 
separation to distinguish mutant from normal DNA, limited applicability to sites 
30 that may be converted into a restriction site, the requirement for additional 

analysis to determine the nature of the mutation, and difficulty in quantifying 
mutant DNA in a high background of normal DNA. In Kuppuswamy, et al., 
"Single Nucleotide Primer Extension to Detect Genetic Diseases: Experimental 
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Application to Hemophilia G (Factor IX) and Cystic Fibrosis Genes," Proc. Natl. 
Acad. Sci. USA 88: 1 143-47 (1991), a PCR process is carried out using two 
reaction mixtures for each fragment to be amplified with one mixture containing a 
primer and a labeled nucleotide corresponding to the normal coding sequence, 
5 while the other mixture contains a primer and a labeled nucleotide corresponding 
to the mutant sequence. The disadvantages of such mini sequencing (i.e. SNuPe) 
are that the mutations must be known, it is not possible to multiplex closely 
clustered sites due to interference of overlapping primers, it is not possible to 
detect single base or small insertions and deletions in small repeat sequences, and 

10 four separate reactions are required. A mutagenically separated PCR process is 
disclosed in Rust, et al., "Mutagenically Separated PCR (MS-PCR): a Highly 
Specific One Step Procedure for easy Mutation Detection" Nucl. Acids Res. 
21(16): 3623-29 (1993) to distinguish normal and mutant alleles, using different 
length allele-specific primers. The disadvantages of MS-PCR include possibly 

1 5 providing false positives due to polymerase extension off normal template, 

requiring electrophoretic separation of products to distinguish from primer dimers, 
the inability to multiplex closely-clustered sites due to interference of overlapping 
primers, the inability to detect single base or small insertions and deletions in 
small repeat sequences, and not being ideally suited for quantification of mutant 

20 DNA in high background of normal DNA. In Suzuki, et al., "Detection of ras 
Gene Mutations in Human Lung Cancers by Single-Strand Conformation 
Polymorphism Analysis of Polymerase Chain Reaction Products," Oncogene 5: 
1037-43 (1990), mutations are detected in a process having a PCR phase followed 
by a phase involving single strand conformation polymorphism ("SSCP") of the 

25 amplified DNA fragments. The disadvantages of SSCP include the requirement 
for electrophoretic separation to distinguish mutant conformer from normal 
conformer, the failure to detect 30% of possible mutations, the requirement for 
additional analysis to determine the nature of the mutation, and the inability to 
distinguish mutant from silent polymorphisms. 

30 

Nucleotide Conversion Fidelity 

Many of the approaches to detecting the presence of a given sequence or 
sequences in a polynucleotide sample involve amplification of the minority 
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sequence(s) by polymerase chain reaction (PCR). U.S. Patent No. 4,683,202 to 
Mullis, et al., and R.K. Saiki, et al., Science 230:1350 (1985). In this method, 
primers complementary to opposite end portions of the selected sequence(s) are 
used to promote, in conjunction with thermal cycling, successive rounds of 
5 primer-initiated replication. The amplified sequence(s) may be readily identified 
by a variety of techniques. This approach is particularly useful for detecting the 
presence of low-copy sequences in a polynucleotide-containing sample, e.g., for 
detecting pathogen sequences in a body-fluid sample. However, a nonselective 
PCR strategy will amplify both mutant and wild-type alleles with approximately 

1 0 equal efficiency, resulting in low abundance mutant alleles comprising only a 
small fraction of the final product. If the mutant sequence comprises less than 
25% of the amplified product, it is unlikely that DNA sequencing will be able to 
detect the presence of such an allele. Although it is possible to accurately 
quantify low abundance mutations by first separating the PCR products by cloning 

15 and subsequently probing the clones with allele-specific oligonucleotides (ASOs) 
(Saiki et al., "Analysis of Enzymatically Amplified Beta-Globin and HLA-DQ 
Alpha DNA with Allele-Specific Oligonucleotide Probes," Nature , 
324(6093): 163-6 (1986); Sidransky et al., "Identification of Ras Oncogene 
Mutations in the Stool of Patients with Curable Colorectal Tumors," Science , 

20 256:102-5 (1992); and Brennan et al., "Molecular Assessment of 

Histopathological Staging in Squamous-Cell Carcinoma of the Head And Neck," 
N. Engl. J. Med. . 332(7):429-35 (1995)), this approach is time consuming. In 
contrast, allele-specific PCR methods can rapidly and preferentially amplify 
mutant alleles. For example, multiple mismatch primers have been used to detect 

25 H-ras mutations at a sensitivity of 1 mutant in 10 5 wild-type alleles (Cha et al., 

"Mismatch Amplification Mutation Assay (MAMA): Application to the C-H-Ras 
Gene," PCR Methods Appl. , 2(l):14-20 (1992)) and claims as high as 1 mutant in 
10 6 wild-type alleles have been reported (Haliassos et al., "Detection of Minority 
Point Mutations by Modified PCR Technique: A New Approach for a Sensitive 

30 Diagnosis of Tumor-Progression Markers," Nucleic Acids Res. , 17:8093-9 

(1989); and Chen et al., "A Nonradioactive, Allele-Specific Polymerase Chain 
Reaction for Reproducible Detection of Rare Mutations in Large Amounts of 
Genomic DNA: Application to Human K-Ras," Anal. Biochem. , 244:191-4 
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(1997)). However, careful evaluation suggests these successes are limited to 
allele-specific primers discriminating through 3' purine-purine mismatches. For 
the more common transition mutations, the discriminating mismatch on the 3' 
primer end {i.e., G:T or C:A mismatch) will be removed in a small fraction of 
5 products by polymerase error during extension from the opposite primer on wild- 
type DNA. Thereafter, these error products are efficiently amplified and generate 
false-positive signal. One strategy to eliminate this polymerase error problem is 
to deplete wild-type DNA early in PCR. 

Several investigators have explored selective removal of wild-type 

10 DNA by restriction endonuclease (RE) digestion in order to enrich for low 
abundance mutant sequences. These RFLP methods detect approximately 1 
mutant in 10 6 wild-type or better by combining the sensitivity of polymerase with 
the specificity of restriction endonucleases. One approach has used digestion of 
genomic DNA followed by PCR amplification of the uncut fragments (RFLP- 

1 5 PCR) to detect very low-level mutations within restriction sites in the H-ras and 
p53 genes (Sandy et al., "Genotypic Analysis of Mutations in Taq I Restriction 
Recognition Sites by Restriction Fragment Length Polymorphism/Polymerase 
Chain Reaction," Proc. Natl. Acad. Sci. USA , 89:890-4 (1992); and Pourzand et 
al., "Genotypic Mutation Analysis by RFLP/PCR," Mutat. Res. . 88(1):1 13-21 

20 (1 993)). Similar results have been obtained by digestion following PCR and 

subsequent amplification of the uncleaved DNA now enriched for mutant alleles 
(PCR-RFLP) (Kumar et al., "Oncogene Detection at the Single Cell Level," 
Oncogene , 3(6):647-51 (1988); Kumar et al., "Designed Diagnostic Restriction 
Fragment Length Polymorphisms for The Detection of Point Mutations in Ras 

25 Oncogenes " Oncogene Res. , 4(3):235-41 (1989); and Jacobson et al., "A Highly 
Sensitive Assay for Mutant Ras Genes and its Application to the Study of 
Presentation and Relapse Genotypes in Acute Leukemia," Oncogene , 9(2):553-63 
(1994)). Although sensitive and rapid, RFLP detection methods are limited by the 
requirement that the location of the mutations must coincide with restriction 

30 endonuclease recognition sequences. To circumvent this limitation, primers that 
introduce a new restriction site have been employed in "primer-mediated" RFLP 
(Jacobson et al., "Rapid, Nonradioactive Screening for Activating Ras Oncogene 
Mutations Using PCR-Primer Introduced Restriction Analysis (PCR-PIRA)," PCR 
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Methods Appl. . 1(4):299 (1992); Chen et al., "A Method to Detect Ras Point 
Mutations in Small Subpopulations of Cells," Anal. Biochem. . 195(l):51-6 
(1991); Di Giuseppe et al., "Detection of K-Ras Mutations in Mucinous 
Pancreatic Duct Hyperplasia from a Patient with a Family History of Pancreatic 
5 Carcinoma," Am. J. Pathol. . 144(5):889-95 (1994); Kahn et al., "Rapid and 

Sensitive Nonradioactive Detection of Mutant K-Ras Genes Via 'Enriched' PCR 
Amplification," Oncogene , 6:1079-83 (1991); Levi et al., "Multiple K-Ras Codon 
12 Mutations in Cholangiocarcinomas Demonstrated with a Sensitive Polymerase 
Chain Reaction Technique," Cancer Research . 51(July):3497-502 (1991); and 
10 Mitsudomi et al., "Mutations of Ras Genes Distinguish a Subset of Non-Small- 
Cell Lung Cancer Cell Lines from Small-Cell Lung Cancer Cell Lines," 
Oncogene . 6(8):1353-62 (1991)). However, subsequent investigators have 
demonstrated that errors are produced at the very next base by polymerase 
extension from primers having 3' natural base mismatches (Hattori et al., 
1 5 "Mismatch PCR RFLP Detection of DRD2 Ser3 1 1 Cys Polymorphism and 

Schizophrenia," Biochem. Biophvs. Res. Commun. . 202(2):757-63 (1994); O'Dell 
et al., "PCR Induction of a Taql Restriction Site at Any CpG Dinucleotide Using 
Two Mismatched Primers (CpG-PCR)," Genome Res. . 6(6):558-68 (1996); and 
Hodanova et al., "Incorrect Assignment of N370S Mutation Status by Mismatched 
20 PCR/RFLP Method in Two Gaucher Patients," J. Inherit. Metab. Pis. . 20(4):61 1-2 
(1997)). Such templates fail to cleave during restriction digestion and amplify as 
false-positives that are indistinguishable from true positive products extended 
from mutant templates. 

Use of nucleotide analogs may reduce errors resulting from 
25 polymerase extension and improve base conversion fidelity. Nucleotide analogs 
that are designed to base-pair with more than one of the four natural bases are 
termed "convertides." Base incorporation opposite different convertides has been 
tested (Hoops et al., "Template Directed Incorporation of Nucleotide Mixtures 
Using Azole-Nucleobase Analogs," Nucleic Acids Res. . 25(24):4866-71 (1997)). 
30 For each analog, PCR products were generated using Taq polymerase and primers 
containing an internal nucleotide analog. The products generated showed a 
characteristic distribution of the four bases incorporated opposite the analogs. Of 
significance, these products retained the original sequence at all natural base 
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positions. Convertides readily form degenerate amplification products by virtue 
of their ability to assume different hydrogen bonding patterns through tautomeric 
shift (Brown et aL, "Synthesis and Duplex Stability of Oligonucleotides 
Containing Adenine-Guanine Analogues," Carbohydrate Research , 216:129-39 
5 (1991)), bond rotation (Bergstrom et aL, Nucleosides and Nucleotides . 

15(l-3):59-68 (1996)), or base stacking (Bergstrom et aL, Journal of the American 
Chemical Society . 1 17:1201-9 (1995); and Zhang et aL, "Exploratory Studies on 
Azole Carboxamides as Nucleobase Analogs: Thermal Denaturation Studies on 
Oligodeoxyribonucleotide Duplexes Containing Pyrrole-3-Carboxamide," Nucleic 

10 Acids Res. . 26:2208-15 (1998)). Thus, PCR primers containing convertides may 
be used to facilitate base conversion. In principle, using the 6H,8H-3,4- 
dihydropyrimido[4,5-c][l,2]oxazine-7-one analog Q 6 ) ? which is known to exhibit 
both the C-like and T-like tautomeric forms at the 3 ' end of the primer (Brown et 
aL, "Synthesis and Duplex Stability of Oligonucleotides Containing Adenine- 

15 Guanine Analogues," Carbohydrate Research , 216:129-39 (1991)), a C-G base- 
pair may be converted to a T-A base pair (Figure 1). Due to the better geometry, 
DNA polymerases may "read," or extend better, from a Q6-G pair than a T-G 
mismatch (wobble base pair). Similarly, DNA polymerases may "write," or 
incorporate both G and A bases opposite Q 6 (Hill et aL, "Polymerase Recognition 

20 of Synthetic Oligodeoxyribonucleotides Incorporating Degenerate Pyrimidine and 
Purine Bases " Proc. Natl. Acad. Sci. USA . 95(8):4258-63 (1998)), whereas A is 
always inserted opposite a T base. Thus, the analog primer serves as an 
intermediary, providing a "preconversion" step before a natural base primer is 
added to selectively amplify the desired product from the degenerate pool. While 

25 nucleotide analogs have great potential, they have not been tested in high 

sensitivity assays. There is a need for a method that optimizes the fidelity of the 
analog conversion in the PCR process. 

Optimization of PCR/RE/LDR 

As discussed above, PCR used with a high fidelity conversion 
30 process would provide a valuable method for the amplification of mutant gene 
sequences. By designing primers with one or more mismatches, mutant DNA 
template can be efficiently extended, while poor extension is achieved on wild- 
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type DNA template. However, once these primers extend with or without a 
mismatch, the products thereafter are perfect matches for the primer in subsequent 
PCR cycles. Thus, false positive signals are amplified in subsequent cycles. 
Moreover, PCR error can generate a base change in the template, which perfectly 
5 matches the primer. AS-PCR can detect pyrimidine^>purine transversions at 

sensitivities of 1 in 10 5 (Newton et al., "Analysis of Any Point Mutation in DNA. 
The Amplification Refractory Mutation System (ARMS)," Nucleic Acids Res. , 
17(7):2503-16 (1989); and Tada et al., "Detection of Ras Gene Mutations in 
Pancreatic Juice and Peripheral Blood of Patients with Pancreatic 
10 Adenocarcinoma," Cancer Res. , 53(1 1):2472-4 (1993)). Nevertheless, the 
^ majority of cancer-associated mutations are C^T and A<-»G transitions, as, for 

sQ example, are over 80% of p53 point mutations (de Fromentel et al., Genes 

m Chromosomes Cancer , 4(1): 1-1 5 (1992)). A DNA diagnostic method is needed to 

jif accurately quantify this type of low abundance mutation. 

M* 1 5 The ligation detection reaction (LDR) in conjunction with PCR has 

been used to quantify small amounts of PCR extension product. LDR uses two 
adjacent primers and a thermostable ligase to distinguish all four bases potentially 

m. found at any position in a DNA sequence (Barany, F., "Genetic Disease Detection 

™ and DNA Amplification Using Cloned Thermostable Ligase," Proc. Natl. Acad. 

□ 20 Sci. USA . 88:189-93 (1991); Barany, F. 5 "The Ligase Chain Reaction in a PCR 

World," PCR Methods Appl. , 1 :5-16 (1991); Day et al., "Detection of Steroid 21- 
Hydroxylase Alleles Using Gene-Specific PCR and a Multiplexed Ligation 
Detection Reaction," Genomics , 29:152-62 (1995); and Khanna et al., Oncogene , 
18:27-38 (1999)). Thermostable ligase demonstrates the highest fidelity when the 
25 discriminating base is located at the 3' end of the upstream primer (Luo et al., 

"Improving the Fidelity of Thermus Thermophilus DNA Ligase," Nucleic Acids 
Res. , 24(15):3071-8 (1996)). PCR/LDR (PCR of a sequence from genomic DNA 
followed by LDR) can detect mutations with a sensitivity of approximately 1 
mutant allele in 4,000 normal alleles (Khanna et al., Oncogene , 18:27-38 (1999)). 
30 Sensitivity of approximately 1 in 10 6 has been achieved by combining PCR with 
restriction endonuclease digestion of wild-type DNA (Sandy et al., Proc. Natl. 
Acad. Sci. USA . 89:890-4 (1992); and Pourzand et al., "Genotypic Mutation 
Analysis by RFLP/PCR " Mutat. Res. . 288(0:1 13-21 (1993)). Mutations 
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occurring within the restriction site prevent cleavage of the mutant allele, while 
wild-type alleles bearing canonical restriction site sequence are depleted. As a 
result, subsequent PCR cycles preferentially amplify mutant DNA. If a mutation 
site is not within an endonuclease recognition site present in wild-type DNA, a 
5 restriction site must be introduced. This is typically done by PCR using a primer 
or primers with mismatched bases. Mutations cannot be detected in any portion of 
the restriction site spanned by the primers, since those bases are introduced 
directly through the primers. In a random DNA sequence, over 20% of bases are 
contained within a preexisting four-base restriction site and 60% of bases are 
1 0 within a four-base subsequence that can be converted into a restriction site by a 
:aSi single base change. In these small sites, 3' terminal base mismatch primers must 

jO frequently be used. While conceptually straightforward, 3' mismatch extension 

m has proven to be difficult (Newton et al., "Analysis of Any Point Mutation in 

ji: DNA. The Amplification Refractory Mutation System (ARMS)," Nucleic Acids 

M; 15 Res. . 17(7):2503-16 (1989); Kwok et al., "Effects of Primer-Template 

£ Mismatches on the Polymerase Chain Reaction: Human Immunodeficiency Virus 

[3 Type 1 Model Studies," Nucleic Acids Res . 18(4):999-1005 (1990); O'Dell et al., 

M= Genome Res. , 6(6):558-68 (1996); and Day et al., Nucleic Acids Res. , (1999)). 

:~ The introduction of interrupted palindromic restriction sites has been more 

□ 20 successful using internal mismatch primers spanning one half-site through the 

intervening bases up to the other half-site (Kumar et al., "Oncogene Detection at 
the Single Cell Level," Oncogene , 3(6):647-51 (1988); and Anderson et al., 
"Prevalence of RAS Oncogene Mutation in Head and Neck Carcinomas," J. 
Otolaryngol. , 21(5):321-6 (1992)). Several perfectly matched bases stabilize the 
25 3' end of the mismatch primer. However, this approach may be used only if the 
second half-site is present naturally in wild-type DNA. Mutations in the second 
half-site prevent digestion. Only mutations occurring at bases within the 
recognition sequence are detectable by RFLP methods. Mutations occurring at 
bases outside a preexisting restriction site in wild-type DNA may be detected by 
30 introducing a new restriction site containing that base. 

Restriction endonucleases recognizing interrupted palindromes are 
less abundant than endonucleases recognizing contiguous four- and six-base sites. 
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Multiple base changes would often be required to introduce an interrupted 
palindrome restriction site to identify mutations at any base. 

More recently, methods of identifying known target sequences by 
probe ligation methods have been reported. U.S. Patent No. 4,883,750 to N.M. 
5 Whiteley, et al.; D.Y. Wu, et al., Genomics 4:560 (1989), U. Landegren, et aL, 
Science 241:1077 (1988), and E. Winn-Deen, et al., Clin. Chem. 37:1522 (1991). 
In one approach, known as oligonucleotide ligation assay ("OLA"), two probes or 
probe elements, that span a target region of interest, are hybridized to the target 
region. Where the probe elements base pair with adjacent target bases, the 
10 confronting ends of the probe elements can be joined by ligation, e.g., by 

treatment with ligase. The ligated probe element is then assayed, evidencing the 
D presence of the target sequence. 

ij In a modification of this approach, the ligated probe elements act as 

5? a template for a pair of complementary probe elements. With continued cycles of 

^ 15 denaturation, hybridization, and ligation in the presence of pairs of probe 

. elements, the target sequence is amplified linearly, allowing very small amounts 

^ of target sequence to be detected and/or amplified. This approach is referred to as 

ligase detection reaction. When two complementary pairs of probe elements are 
^ utilized, the process is referred to as the ligase chain reaction, which achieves 

*J 20 exponential amplification of target sequences. F. Barany, "Genetic Disease 

Detection and DNA Amplification Using Cloned Thermostable Ligase," Proc. 
Naf 1 Acad. Sci. USA , 88:189-93 (1991) and F. Barany, "The Ligase Chain 
Reaction (LCR) in a PCR World," PCR Methods and Applications . 1 :5-16 (1991); 
Barany, U.S. Patent Application filed March 19, 1999, S/N 60/125,251. 
25 Techniques, such as PCR/LDR, that rely on mutant enrichment require 

optimization of reaction conditions in order to minimize random PCR errors. 
These errors would be indistinguishable from mutations originally present in 
clinical samples. One source of error-minimization may be found in optimization 
of the buffer conditions for PCR. Standard PCR buffers contain Tris, however the 
30 pK a of Tris is strongly dependent on temperature. A PCR reaction containing Tris 
pH 8.3 (measured at 23°C) is approximately pH 7 near 65°C (the extension 
temperature), and drops to approximately pH 6 near 95°C (the template melting 
temperature). PCR error can result from template degradation and polymerase 
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misincorporation. Template degradation occurs during periods of high 
temperature and low pH in each PCR cycle and limits product size in "long" PCR 
(Barnes, "PCR Amplification of up to 35-Kb DNA with High Fidelity and High 
Yield from Lambda Bacteriophage Templates/' Proc. Natl. Acad. Sci. USA . 
91(6):2216-20 (1994); Cheng et al. ? "Effective Amplification of Long Targets 
from Cloned Inserts and Human Genomic DNA," Proc. Natl. Acad. Sci. USA . 
91(12):5695-9 (1994); and Sang et al., "Generation of Site-Directed Mutagenesis 
by Extralong, High-Fidelity Polymerase Chain Reaction," Anal. Biochem. . 
233(l):142-4 (1996)). Raising the buffer pH in long PCR (using Tris 9.1) reduces 
the amount of template cleavage and increases PCR efficiency (Barnes, "PCR 
Amplification of up to 35-Kb DNA with High Fidelity and High Yield from 
Lambda Bacteriophage Templates," Proc. Natl. Acad. Sci. USA . 91(6):2216-20 
(1994)). Although the efficiency of long PCR increases with higher pH, the level 
of mutations within these PCR products may also increase since high pH 
decreases the fidelity of Tag and Pfu polymerases (Eckert et aL, "High Fidelity 
DNA Synthesis by the Thermus Aquaticus DNA Polymerase," Nucleic Acids 
Res,, 18(13):3739-44 (1990); Eckert et aL, "DNA Polymerase Fidelity and the 
Polymerase Chain Reaction," PCR Methods Appl. . l(l):17-24 (1991); and Cline 
et al., "PCR Fidelity of Pfu DNA Polymerase and Other Thermostable DNA 
Polymerases," Nucleic Acids Res. . 24H 8):3546-51 (1996)). Use of alternative 
PCR buffers with lower | ApKa | can improve polymerase fidelity and still reduce 
template damage by maintaining more neutral pH over a wider temperature range 
(Eckert et al., "DNA Polymerase Fidelity and the Polymerase Chain Reaction," 
PCR Methods AppL . l(l):17-24 (1991); and Brail et al., "Improved Polymerase 
Fidelity in PCR-SSCPA," Mutat. Res. . 303(4): 171 -5 (1993)). The addition of 
glycerol or formamide may reduce mutations arising from template damage 
during PCR cycling and may help avoid misextension from mispaired primers 
(Bottema et al., "PCR Amplification of Specific Alleles: Rapid Detection Of 
Known Mutations and Polymorphisms," Mutat. Res. . 288(1):93-102 (1993); and 
Cha et al., "Mismatch Amplification Mutation Assay (MAMA): Application to the 
C-H-Ras Gene," PCR Methods Appl. . 2(l):14-20 (1992)). 

Thus, there is a need to improve buffer reaction conditions 
currently used in PCR, in order to minimize the opportunity for mismatches 
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caused by PCR error. Increased analog conversion fidelity, alone, will not solve 
the need for an improved method of mutant DNA detection. In addition, there is a 
need to optimize PCR reaction conditions to decrease random PCR error, and 
finally, a method is needed that provides sensitive detection for the PCR extension 
5 products. 

SUMMARY OF THE INVENTION 

The present invention relates to a method for identifying one or 
more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence, in a plurality of target 
nucleotide sequences. This method involves a first polymerase chain reaction 

1 0 phase, a second polymerase chain reaction phase, and a restriction endonuclease 
digest reaction phase, followed by a third polymerase chain reaction phase and a 
ligase detection reaction phase. 

The starting sample of the present invention is a sample potentially 
containing one or more low abundance target nucleotide sequences with at least 

1 5 one sequence difference each from the high abundant target sequences present in 
the sample. 

In the first polymerase chain reaction phase, a primary 
oligonucleotide primer set is provided. The primary oligonucleotide primer set 
has a first oligonucleotide primer containing a target- specific portion, and a 

20 second oligonucleotide primer containing a target-specific portion. The primary 
oligonucleotide primers are suitable for hybridization on complementary strands 
of a corresponding high and low abundance target nucleotide sequences to permit 
formation of a polymerase chain reaction product. However, the primers each 
have a mismatch which interferes with formation of such a polymerase chain 

25 reaction product when hybridized to any other nucleotide sequence present in the 
sample. The primary oligonucleotide primers, the sample, and a polymerase are 
blended to form a primary polymerase chain reaction mixture. 

The primary polymerase chain reaction mixture is subjected to two 
or more polymerase chain reaction cycles involving a denaturation treatment, a 

30 hybridization treatment, and an extension treatment. The denaturing treatment 
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separates hybridized nucleic acid sequences. The hybridization treatment causes 
the target-specific portions of the primary oligonucleotide primers to hybridize to 
the target nucleotide sequences. The extension treatment causes the hybridized 
primary oligonucleotide primers to be extended to form primary extension 
5 products complementary to the target nucleotide sequence to which the primary 
oligonucleotide primer is hybridized. 

Next, there is a second polymerase chain reaction phase. This 
phase involves providing a secondary oligonucleotide primer set having a first 
oligonucleotide primer with a target-specific portion and a 5' upstream secondary 
10 primer-specific portion and a second oligonucleotide primer with a target- specific 
!=a portion and a 5' upstream secondary primer-specific portion. The secondary 

i£3 oligonucleotide primers in a particular set are suitable for hybridization on 

jy complementary strands of the primary extension products to permit formation of a 

!2? secondary polymerase chain reaction product which contains or creates a 

M 15 restriction endonuclease recognition site when amplifying the high abundance 

r target, but does not contain or create a restriction endonuclease recognition site 

H when amplifying the one or more low abundance targets. The primary extension 

M* products, the secondary oligonucleotide primers, and the polymerase are blended 

i« to form a secondary polymerase chain reaction mixture. 

□ 20 The secondary polymerase chain reaction mixture is subjected to 

two or more polymerase chain reaction cycles comprising a denaturation 
treatment, a hybridization treatment, and an extension treatment. The 
denaturation treatment involves separation of hybridized nucleic acid sequences. 
In the hybridization treatment, the secondary oligonucleotide primers hybridize to 
25 the primary extension products. The extension treatment causes the hybridized 
primary extension products to form secondary extension products complementary 
to the primary extension products. The high abundance secondary extension 
products contain a restriction site but the low abundance secondary extension 
products do not. 

30 The next phase involves blending a restriction endonuclease with 

the secondary extension products to form an endonuclease digestion reaction 
mixture. The restriction endonuclease is one that cleaves, with specificity, the 
restriction endonuclease recognition site within or created when amplifying the 
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high abundance target, but not the low abundance target in the secondary 
extension products. The restriction endonuclease digestion selectively destroys 
the high abundance secondary extension products. 

Next, there is a third polymerase chain reaction phase. This 
5 involves providing a tertiary oligonucleotide primer set having a first tertiary 
primer containing the same sequence as the 5* upstream portion of the first 
oligonucleotide primer of the secondary oligonucleotide primer set and a second 
tertiary primer containing the same sequence as the 5' upstream portion of a 
second oligonucleotide primer of the secondary oligonucleotide primer set. The 

1 0 set of tertiary oligonucleotide primers may be used to amplify all of the secondary 
extension products. The secondary extension products are blended with the 
tertiary oligonucleotide primer set, and a polymerase to form a tertiary polymerase 
chain reaction mixture. 

The tertiary polymerase chain reaction mixture is subjected to two 

15 or more polymerase chain reaction cycles comprising a denaturation treatment, a 
hybridization treatment, and an extension treatment. The denaturation treatment 
causes the hybridized nucleic acid sequences to be separated, while the 
hybridization treatment involves hybridization of the tertiary oligonucleotide 
primers to hybridize to the secondary extension products. During the extension 

20 treatment, the hybridized tertiary oligonucleotide primers are extended to form 
tertiary extension products complementary to the uncleaved secondary extension 
products. 

Next, the tertiary extension products are subjected to a ligase 
detection reaction. This involves providing a plurality of oligonucleotide probe 

25 sets, each set having a first oligonucleotide probe, having a tertiary extension 
product-specific portion and a detectable reporter label, and a second 
oligonucleotide probe, having a tertiary extension product-specific portion. The 
oligonucleotide probes in a particular LDR probe set are suitable for ligation when 
hybridized adjacent to one another on a complementary tertiary extension product- 

30 specific portion. However, there is a mismatch which interferes with such ligation 
when hybridized to any other nucleotide sequence present in the sample. 

The tertiary extension products, the plurality of oligonucleotide 
probe sets, and a ligase are blended to form a ligase detection reaction mixture. 
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The ligase detection reaction mixture is subjected to one or more 
ligase detection reaction cycles having a denaturation treatment, and a 
hybridization treatment. The denaturation treatment involves separation of 
hybridized oligonucleotides from the tertiary extension products. In the 
hybridization treatment, the oligonucleotide probe sets hybridize at adjacent 
positions in a base-specific manner to their respective tertiary extension products, 
if present. As a result, adjacent probes ligate to one another to form a ligation 
product sequence containing the detectable reporter label and the tertiary 
extension product-specific portions connected together. The oligonucleotide 
probe sets may hybridize to nucleotide sequences other than their respective 
complementary tertiary extension products but do not ligate together due to a 
presence of one or more mismatches and individually separate during the 
denaturation treatment. Following the ligase detection reaction cycles, the 
reporter labels of the ligation product sequences are detected which indicates the 
presence of one or more low abundance target nucleotide sequences in the sample. 

The present invention also relates to a kit for identifying one or 
more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence in a plurality of target 
nucleotide sequences. This kit provides a primary oligonucleotide primer set, a 
secondary oligonucleotide primer set, a tertiary oligonucleotide primer set, and a 
plurality of oligonucleotide probe sets. 

The primary oligonucleotide primer set provided in the kit has (a) a 
first oligonucleotide primer containing a target-specific portion, and (b) a second 
oligonucleotide primer containing a target-specific portion. The primary 
oligonucleotide primers are suitable for hybridization on complementary strands 
of a corresponding high and low abundance target nucleotide sequences to permit 
formation of a primary extension product. However, the primers have a 
mismatch which interferes with formation of such a polymerase chain reaction 
product when hybridized to any other nucleotide sequence present in the sample. 

The secondary oligonucleotide primer set provided in the kit has 
(a) a first oligonucleotide primer, containing a target-specific portion and a 5' 
upstream secondary primer-specific portion, and (b) a second oligonucleotide 
primer, containing a target-specific portion and a 5' upstream secondary primer- 



specific portion. The secondary oligonucleotide primers are suitable for 
hybridization on complementary strands of the primary extension products to 
permit formation of a secondary extension product which contains or creates a 
restriction endonuclease recognition site when amplifying the high abundance 
target, but does not contain or create a restriction endonuclease recognition site 
when amplifying the one or more low abundance targets. 

The tertiary oligonucleotide primer set provided in the kit have (a) 
a first tertiary primer containing the same sequence as the 5' upstream portion of 
the first oligonucleotide primer of the secondary oligonucleotide primer set, and 
(b) a second tertiary primer containing the same sequence as the 5' upstream 
portion of a second oligonucleotide primer of the secondary oligonucleotide 
primer set. The set of tertiary oligonucleotide primers may be used to amplify all 
of the secondary extension products. 

The kit also provides a plurality of oligonucleotide probe sets. 
Each set has (a) a first oligonucleotide probe, containing a tertiary extension 
product-specific portion and a detectable reporter label, and (b) a second 
oligonucleotide probe, containing a tertiary extension product-specific portion. 
The oligonucleotide probes in a particular set are suitable for ligation together 
when hybridized adjacent to one another on a complementary tertiary extension 
product-specific portion. However, the probes have a mismatch which interferes 
with such ligation when hybridized to any other nucleotide sequence present in the 
sample. 

There is not currently available a biological technique that is 
sensitive enough to detect and identify one or more low abundance sequences 
differing by one or more single-base changes, insertions, or deletions, from a high 
abundance sequence, in a plurality of target nucleotide sequences, without the 
potential for many errors during the amplification and detection stages. The 
present invention is directed to overcoming these and other deficiencies in the art, 
by providing a method which optimizes the conditions for selective amplification 
of low abundance DNA targets through polymerase chain reaction and restriction 
endonuclease digestion, and a method of nucleotide detection, that combined, 
creates a highly sensitive method for the amplification and detection of low 
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abundance nucleotides sequences such as mutant genes. This procedure is useful 
for clinical and research applications. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 

Figure 1 is a schematic drawing of the coupled polymerase chain 
reaction-restriction endonuclease digestion-ligase detection reaction process of the 
present invention. 

Figure 2 shows the detection of ligation products resulting from the 
10 process of Figure 1, using gel electrophoresis. 

Figure 3 shows the detection of ligation products resulting from the 
process of Figure 1 using an addressable array. 

Figure 4 shows a conversion facilitated by nucleotide analog 
preconversion. A C:G base pair in a sequence is targeted for conversion to a T:A 
1 5 basepair. Rather than using a 3 ' natural base mismatch primer to attempt direct 
conversion, a nucleotide analog (Qe) primer is used for preconversion. The Qe 
analog reads the G base well and allows polymerase to efficiently extend from the 
3' Q6 primer. During PCR, the reverse primer anneals to the Qe PCR product and 
is extended by polymerase to synthesize the opposite strand. When polymerase 
20 reaches the Q6 analog in the template, polymerase writes A (or G, not shown) 
opposite the analog and continues synthesis of the strand. After a few cycles, a 
pool of products is made with degenerate sequence opposite the analogs. A 
natural base primer is then added to selectively amplify the products having the 
desired base change. 

25 Figure 5 A-I shows the nucleotide analogs used in PCR primers. In 

the final deprotected oligonucleotide, the name of the nucleoside containing the 
base analog shown are as follows: Qi, l-(2'-deoxy-p-D-ribofuranosyl)imidazole- 
4-carboxamide (Figure 5A); Q 2 , l-(2'-deoxy-p-D-ribofuranosyl)-3-nitropyrrole 
(Figure 5B); Q 5 , 2'-deoxyinosine (Figure 5C); Q 6 , 6-(2'-deoxy-p-D- 

30 ribofuranosyl)-6H,8H-3,4-dihydropyrimido[4,5-c][l ,2]oxazine-7-one (Figure 5D); 
Q7, 2-amino-7-(2'-deoxy-P-D-ribofuranosyl)-6-methoxyaminopurine (Figure 5E); 
Qi6 ? 1 -(2 l-deoxy-P-D-ribofuranosyl)-4-iodopyrazole (Figure 5F); Qi 8? l-(2'- 
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deoxy-P-D-ribofuranosyl)pyrrole-3-carboxamide (Figure 5G); Qi 9 , l-(2'-deoxy- 
(3-D-ribofuranosyl)-4-nitropyrazole (Figure 5H). Base analogs (Q) are attached to 
the 1 ' position of deoxyribofuranose. The nucleoside analogs are attached to the 
controlled pore glass (CPG) column via a succinyl linker (R = linker to CPG). 
5 The oligonucleotide is synthesized from the 5' hydroxyl after removal of the 
dimethoxytrityl (DMT) protecting group, placing the analog at the 3 ' end. After 
cleavage from the CPG column and deprotection, the oligonucleotide is extended 
by polymerase from the 3' base analog hydroxyl group (R = H) (Figure 51). 

Figures 6A-C show the primers used in mismatch extension and 3' 

10 nucleotide analog conversion. Complementary (- strand) sequences are shown in 
reverse orientation (3-5'), e.g., reverse strand primers (names ending in "R"). In 
Figure 6 A, one of nine different synthetic 50 bp duplex templates is shown melted 
with primers aligned to complementary sequence. Primer extension was 
performed using 3' natural base and nucleotide analog primers (p53-248X and 

15 p53-248XR). Some extension products were reamplifled using truncated zipcode 
primers p53zip248 and p53zip248R and sequenced using one of the zipcode 
primers (Ztop or Zbot). In Figure 6B, preconversion was performed on nine 
different 50 bp synthetic duplex templates using 3' nucleotide analog primers, 
e.g. , p53-248Q 6 and p53-248Q 6 R. Conversion, with or without preconversion, 

20 was performed using primers containing 3' natural base, e.g. primers p53zip248T 
and p53zip248TR. These conversion products were reamplifled using zipcode 
primers and identified by LDR. In Figure 6C, LDR probe sets were designed to 
identify specific base changes in conversion products. LDR probes anneal in 
competition with each other to conversion products. Perfectly complementary 

25 upstream and downstream LDR probes with no overlap or gap ligate with high 
specificity. Discriminating probes had different length 5' tails to allow specific 
product separation on an acrylamide gel. A set of probes used to identify PCR 
error products in nonconversion of wild-type template is shown. 

Figures 7A-B show results of a conversion by natural base and Q 6 

30 convertide. Conversion products from nine templates were detected by 

PCR/LDR. Each template was a 50-base pair synthetic duplex DNA of identical 
sequence except for the central four bases which have the sequence indicated. 
Conversion occurred within these four bases. The expected conversion products 
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produced by starting with the conversion primers having the indicated 3 ' natural 
base or convertide are shown. Figure 7A shows the conversion of the first base to 
C with and without Q$ preconversion. Figure 7B shows the conversion of the first 
base to T with and without Q6 preconversion. 
5 Figures 8A-B shows conversion by natural base, Q 5 , and Q 7 

convertides. Conversion products from nine templates were detected by 
PCR/LDR. Each template was a 50-base pair synthetic duplex DNA of identical 
sequence except for the central four bases which have the sequence indicated. 
Conversion occurred within these four bases. The expected conversion products 

1 0 produced by starting with the conversion primers having the indicated 3 ' natural 
base or convertide are shown. Figure 8A shows conversion of the first base to G 
with and without Q 5 or Q 7 preconversion. Figure 8B shows conversion of the first 
base to A with and without Q5 or Q 7 preconversion. 

Figures 9A-H show the fidelity of polymerase extension. Primer 

1 5 slippage accounts for many of the observed products of extension (Figure 7 and 
Figure 8). In Figure 9A, perfectly complementary primer gives correct product. 
For Figure 9B, a T:G mismatch at the second base explains TGGA (or TGCA) 
product. In Figure 9C, extension from a Qe'-G pairing with no slippage on the 
minus strand of the CCGG template (followed by 3 ' T conversion primers) 

20 resulted in the expected TCGA product. Figure 9D shows extension from a Q 6 :G 
pairing with no slippage on the minus strand of the CTGG template and several 
other templates (followed by 3' T conversion primers) resulted in the expected 
products. In Figure 9E, GG mismatch extension apparently gave the expected GC 
product on one template, but perhaps only fortuitously (see Figure 9F). For 

25 Figure 9F, all extensions from GG mismatches gave GC extension products, 
consistent with a GT mismatch formed by slippage at the preceding base. In 
Figure 9G, Qs 'G and Q 7 :G extension products apparently gave the expected GC 
product on one template, but perhaps only fortuitously (see Figure 9H). In Figure 
9H, all extensions from Q 5 :G and Q 7 :G mismatches (followed by 3' G conversion 

30 primers) gave GC extension products consistent with a Q 5 :T or Q 7 :T mismatch at 
the preceding base. 

Figure 10A-C is a schematic of the amplification of p53 exon 7 
from genomic DNA (Figure 1 OA), and conversion of the normal Mspl site to a 
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Taql site. First, the template is subjected to PCR amplification of the p53 Exon 7 
(Figure 10B). Next, a PCR conversion step is carried out, resulting in the 
conversion of codon 248 to a the to create a (TCGA) Taql restriction site (Figure 
10B). 

Figures 1 1 A-C show the PCR primers used to determine 
polymerase fidelity by PCR/RE/LDR at a preexisting restriction site to avoid 
conversion error. The method used is as described in Figure 6 for conversion 
steps. However, in the fidelity experiments shown in Figure 1 1, the buffer 
components were modified, to test for optimal buffer conditions within the 
conversion process. Figure 1 1 A shows a PCR fidelity assay. A synthetic 50-base 
pair duplex marker template (MK) and wild-type p53 exon 7 PCR product are 
mixed at known ratios in parallel reactions. Perfect match primers p53-248short 
and p53-248shortR amplify the wild-type CCGG and marker CGGG. Then, 
longer zipcode containing primers p53zip248short and p53zip248shortR were 
added. Finally, wild-type was repeatedly digested and reamplified with zipcode 
primers (Ztop and Zbot). In Figure 1 IB, preconversion was performed using 
primers containing 3' convertide; e.g., p53-248Q 6 . Conversion of the Mspl site to 
a Taql site with or without preconversion was performed using 3 ' natural base 
primers p53zip248T and p53zip248TR. Long primers were added as above and 
conversion products further amplified. Wild-type products were digested with the 
restriction endonuclease appropriate for the new site. Mutant products were 
preferentially amplified with zipcode primers. In Figure 1 1C; LDR probe sets 
were designed to query the template sequence around the point of ligation. 
Perfectly hybridized upstream and downstream LDR probes with no overlap or 
gap are preferentially ligated with high specificity. Discriminating probes have 
different length 5' tails to allow specific product separation on an acrylamide gel. 
Probes shown were used for identification of mutations occurring in the second 
base of the Mspl site (no conversion). An extra probe (p53LDR248FTCL) was 
used to compare C^T transitions at the first base and second base of the Mspl 
site. A comparable set of discriminating and common probes were used to 
identify mutations at the second base of the Taql site in conversion products had a 
T at the 3' penultimate base in the discrimination primers and A at the 5' 
penultimate base in the common probe. 



-23 - 



Figures 12A-B show the buffer and enzyme dependent PCR errors 
detected by the PCR/RE/LDR experiments shown in Figure 1 1 . The indicated 
polymerase/buffer combinations were used to amplify p53 exon 7 from genomic 
DNA. The same buffers were used in reactions with perfect match primers to 
5 reamplify the Mspl site. In Figure 12A, Taq polymerase (T) was used in various 
test PCR buffers, while in Figure 12B, Vent polymerase (V) was used in various 
test buffers. Vent polymerase did not amplify p53 exon 7 from genomic DNA in 
TsK ? buffer A, buffer. In this case only, two different enzyme/buffer sets were 
used for preamplification and "conversion" (not actual conversion, since perfect 
10 match primers were used). The Ampli7a# T/TsK exon 7 genomic DNA PCR 
;saa product was substituted in the Vent V/TsK, buffer A, reamplification. "C" 

fi indicates no MK was added (control reaction). 

5j Figure 13 shows the results of a comparison of conversion fidelity. 

* » IF* 

\ J The relative intensities of conversion reaction products is indicated by color and 

M= 15 the height of each peak in a 3-D plot. Marker (MK) DNA (with CGGG replacing 

the Mspl site) was added at known ratios to wild-type (WT) in parallel reactions. 
The -log(MK:WT) indicates relative fraction of MK present, e.g., 
jl -log(MK: WT)=3 means the ratio of MK to WT was 1 : 1 000. "C" indicates no MK 

.2: was added (control reaction). Noncon version control reactions (C:G) were 

□ 20 performed using perfect match 3' C primers. Conversion of the Mspl site 

(CCGG) to a Taql site (TCGA) was performed using natural base 3' T primers 
with and without preconversion using V Q 6 nucleotide analog primers (Qe:G and 
T:G reactions, respectively). LDR products from Mspl nonconversion contain 
CNGG, and products from Taql conversion contain TNGA, but only the central 
25 bases (second and third bases) are indicated as _NG_. The LDR products were 
designed to separate on acrylamide gels by two base differences in size. Some 
undetermined bands of intermediate size were also observed. Lanes 1 -4 were 
digested with Mspl, while lanes 5-12 were digested with Taql during 
PCR/RE/LDR. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to a method for identifying one or 
more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence, in a plurality of target 
nucleotide sequences. This method involves a first polymerase chain reaction 
phase, a second polymerase chain reaction phase, and a restriction endonuclease 
digestion reaction phase, followed by a third polymerase chain reaction phase and 
a ligase detection reaction phase. 

The starting sample of the present invention is a sample potentially 
containing one or more low abundance target nucleotide sequences with at least 
one sequence difference each from the high abundance target sequences present in 
the sample. 

In the first polymerase chain reaction phase, a primary 
oligonucleotide primer set is provided. The primary oligonucleotide primer set 
has a first oligonucleotide primer containing a target-specific portion, and a 
second oligonucleotide primer containing a target-specific portion. The primary 
oligonucleotide primers are suitable for hybridization on complementary strands 
of a corresponding high and low abundance target nucleotide sequences to permit 
formation of a polymerase chain reaction product. However, the primers each 
have a mismatch which interferes with formation of such a polymerase chain 
reaction product when hybridized to any other nucleotide sequence present in the 
sample. The primary oligonucleotide primers, the sample, and a polymerase are 
blended to form a primary polymerase chain reaction mixture. 

The primary polymerase chain reaction mixture is subjected to two 
or more polymerase chain reaction cycles involving a denaturation treatment, a 
hybridization treatment, and an extension treatment. The denaturation treatment 
separates hybridized nucleic acid sequences. The hybridization treatment causes 
the target-specific portions of the primary oligonucleotide primers to hybridize to 
the target nucleotide sequences. The extension treatment causes the hybridized 
primary oligonucleotide primers to be extended to form primary extension 
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products complementary to the target nucleotide sequence to which the primary 
oligonucleotide primer is hybridized. 

The efficiency and accuracy of converting the high abundance 
primary polymerase chain reaction product into a secondary polymerase chain 
reaction product containing a restriction endonuclease site may be improved by 
performing a step (referred to a "preconversion") prior to providing the secondary 
oligonucleotide primer set. This step consists of providing a pre-secondary 
oligonucleotide primer set having (a) a first oligonucleotide primer, with a target- 
specific portion, and (b) a second oligonucleotide primer, with a target-specific 
portion. The target-specific portions are identical or substantially identical to the 
secondary oligonucleotide primer set but at least one primer contains one or more 
nucleotide analogs. The oligonucleotides in a particular pre-secondary primer set 
are suitable for hybridization on complementary strands of the primary extension 
products. The primary extension product is denatured, and the primary extension 
products are blended with a polymerase and the pre-secondary oligonucleotide 
primers to form a pre-secondary polymerase chain reaction mixture. The mixture 
is subjected to two or more PCR cycles to permit the formation of a pre-secondary 
polymerase chain reaction product. This product contains one or more nucleotide 
analogs and opposite strand base changes, which facilitates the conversion of the 
primary polymerase chain reaction product sequence into a restriction 
endonuclease recognition site in the subsequent secondary polymerase chain 
reaction. 

Next, there is a second polymerase chain reaction phase. This 
phase involves providing a secondary oligonucleotide primer set having a first 
oligonucleotide primer with a target-specific portion and a 5' upstream secondary 
primer-specific portion and a second oligonucleotide primer with a target-specific 
portion and a 5' upstream secondary primer-specific portion. The secondary 
oligonucleotide primers in a particular set are suitable for hybridization on 
complementary strands of the primary extension products to permit formation of a 
secondary polymerase chain reaction product which contains or creates a 
restriction endonuclease recognition site when amplifying the high abundance 
target, but does not contain or create a restriction endonuclease recognition site 
when amplifying the one or more low abundance targets. The primary extension 
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products, the secondary oligonucleotide primers, and the polymerase are blended 
to form a secondary polymerase chain reaction mixture. 

The secondary polymerase chain reaction mixture is subjected to 
two or more polymerase chain reaction cycles comprising a denaturation 
5 treatment, a hybridization treatment, and an extension treatment. The 

denaturation treatment involves separation of hybridized nucleic acid sequences. 
In the hybridization treatment, the secondary oligonucleotide primers hybridize to 
the primary extension products. The extension treatment causes the hybridized 
primary extension products to form secondary extension products complementary 
10 to the primary extension products. The high abundance secondary extension 
products contain a restriction site but the low abundance secondary extension 
products do not. 

The next phase involves blending a restriction endonuclease with 
the secondary extension products to form an endonuclease digestion reaction 

15 mixture. The restriction endonuclease is one that recognizes and cleaves the 

restriction endonuclease recognition site within or created when amplifying the 
high abundance target, but not the low abundance target in the secondary 
extension products. The restriction endonuclease digestion selectively destroys 
the high abundance secondary extension products. 

20 Next, there is a third polymerase chain reaction phase. This 

involves providing a tertiary oligonucleotide primer set having a first tertiary 
primer containing the same sequence as the 5' upstream portion of the first 
oligonucleotide primer of the secondary oligonucleotide primer set and a second 
tertiary primer containing the same sequence as the 5' upstream portion of a 

25 second oligonucleotide primer of the secondary oligonucleotide primer set. The 
set of tertiary oligonucleotide primers may be used to amplify all of the secondary 
extension products. The secondary extension products are blended with the 
tertiary oligonucleotide primer set, and a polymerase to form a tertiary polymerase 
chain reaction mixture. 

30 The tertiary polymerase chain reaction mixture is subjected to two 

or more polymerase chain reaction cycles comprising a denaturation treatment, a 
hybridization treatment, and an extension treatment. The denaturation treatment 
causes the hybridized nucleic acid sequences to be separated, while the 
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hybridization treatment involves hybridization of the tertiary oligonucleotide 
primers to hybridize to the secondary extension products. During the extension 
treatment, the hybridized tertiary oligonucleotide primers are extended to form 
tertiary extension products complementary to the secondary extension products. 
5 Next, the tertiary extension products are subjected to a ligase 

detection reaction. This involves providing a plurality of oligonucleotide probe 
sets, each set having a first oligonucleotide probe, having a tertiary extension 
product-specific portion and a detectable reporter label, and a second 
oligonucleotide probe, having a tertiary extension product-specific portion. The 
1 0 oligonucleotide probes in a particular set are suitable for ligation together when 

hybridized adjacent to one another on a complementary tertiary extension product- 
specific portion. However, there is a mismatch which interferes with such ligation 
when hybridized to any other nucleotide sequence present in the sample. 

The tertiary extension products, the plurality of oligonucleotide 
1 5 probe sets, and a ligase are blended to form a ligase detection reaction mixture. 

The ligase detection reaction mixture is subjected to one or more 
ligase detection reaction cycles having a denaturation treatment, and a 
hybridization treatment. The denaturation treatment involves separation of 
hybridized oligonucleotides from the tertiary extension products. In the 
20 hybridization treatment, the oligonucleotide probe sets hybridize at adjacent 

positions in a base-specific manner to their respective tertiary extension products, 
if present. As a result, adjacent probes ligate to one another to form a ligation 
product sequence containing the detectable reporter label and the tertiary 
extension product-specific portions connected together. The oligonucleotide 
25 probe sets may hybridize to nucleotide sequences other than their respective 
complementary tertiary extension products but do not ligate together due to a 
presence of one or more mismatches and individually separate during the 
denaturation treatment. Following the ligase detection reaction cycles, the 
reporter labels of the ligation product sequences are detected which indicates the 
30 presence of one or more low abundance target nucleotide sequences in the sample. 

Figure 1 is a schematic diagram of the amplification and detection 
of mutant DNA by PCR/RE/LDR according to the method of the present 
invention. The process begins with a sample containing a normal, or wild-type 
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sequence in high abundance, and, possibly, a mutant of that sequence, such as a 
cancer gene, in low abundance. Comparing the mutant and wild type sequences in 
Figure 1, it is apparent that they differ with respect to only one base. In the wild- 
type sequence, the distinguishing base is a "C," while the mutant sequence has a 
"T" at the corresponding location. 

First, the sample is subjected to a primary PCR step, which creates 
a modified target sequence. In the primary PCR procedure shown in Steps 1 and 2 
of Figure 1 , wild-type and mutant DNA undergo denaturation at 94°C, to create 
single-stranded DNA templates. As shown in Step 1 of Figure 1, primers A and 
B, each containing 3' terminal nucleotide analogue are annealed to the single 
stranded DNA templates. Facilitated by a polymerase, the analogue primers 
undergo an extension procedure as shown in Step 2 of Figure 1. This process of 
denaturing double stranded nucleic acids, hybridizing primers to the resulting 
single stranded nucleic acids, and extending the primers is repeated, in accordance 
with conventional PCR procedures, to produce, in quantity, extension products in 
the form of modified target sequences containing the nucleotide analogue. The 
polymerase chain reaction process is fully described in H. Erlich, et al., "Recent 
Advances in the Polymerase Chain Reaction," Science 252: 1643-50 (1991); M. 
Innis, et aL, PCR Protoc ols: A Guide to Methods and App lications. Academic 
Press: New York (1990); and R. Saiki, et al., "Primer-directed Enzymatic 
Amplification of DNA with a Thermostable DNA Polymerase," Science 239: 487- 
91 (1988), which are hereby incorporated by reference. Further details for use of 
PCR in accordance with the present invention are provided in the Examples 
below. 

As shown in Step 1 of Figure 1 , the wild-type and mutant 
sequences have complementary X:W and Y:Z base pairs proximate to the 
distinguishing bases of these sequences. When these sequences are denatured in 
the primary PCR procedure, primers A and B are configured to hybridize to those 
sequences such that Q bonds to the Y and W bases. In subsequent cycles of the 
primary PCR step, polymerase encounters a Q analog in what is now a template 
strand. The polymerase can "read" the analog as one of several bases and it will 
"write" opposite the Q one of several different bases which in effect are 
"complementary" to the analog. Because different bases can be incorporated at 
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such positions, the products are degenerate, ideally having G, A, T and C present 
in the same positions opposite Q analogs in the pool of products. These 
degenerate bases are indicated by "N." As shown in Step 2 of Figure 1, the 
mismatch, formed in both the wild-type and mutant sequence, is signified as base 
"N". 

Nucleotide analogs suitable for the present invention include, but 
are not limited to, the following: Ql, l-(2'-deoxy-6-D-ribofuranosyl)imidazole-4- 
carboxamide; Q2, l-(2'-deoxy-fl-D-ribofuranosyl)-3-nitropyrrole; Q5, 2*- 

deoxyinosine; Q6 ? 6-(2'-deoxy-B-D-ribofuranosyl)-6H,8H-3,4- 
dihydropyrimido[4,5-c][l,2]oxazine-7-one; Q7, 2-amino-7-(2'-deoxy-B-D- 
ribofuranosyl)-6-methoxyaminopurine; Q16, 1 -(2 '-deoxy-B-D-ribofuranosyl)-4- 
iodopyrazole; Q18, l-(2'-deoxy-fl-D-ribofuranosyl)pyrrole-3-carboxamide; Q19, 
1 -(2 '-deoxy-B-D-ribofuranosyl)-4-nitropyrazole. 

Next, as shown in Step 3 of Figure 1, a secondary PCR phase is 
carried out. The oligonucleotide primer set of the secondary PCR phase has a first 
oligonucleotide primer with a target-specific portion and a 5' upstream secondary 
primer-specific portion, and a second oligonucleotide primer with a target-specific 
portion and a 5' upstream secondary primer-specific portion. These are 
represented by primers "C" and "D" in Step 3 of Figure 1 . During the secondary 
PCR phase, the target specific portions of the secondary PCR primers will 
hybridize to complementary strands of the primary extension products. As shown 
in Step 3 of Figure 1, these hybridized primers are then extended using 
polymerase to form secondary PCR extension products, as shown in Step 3 of 
Figure 1 . The secondary PCR phase is repeated for 2-20 PCR cycles, involving 
denaturation of double stranded nucleic acids, primer hybridization, and primer 
extension, until the primary PCR extension product is sufficiently amplified. 

Due to the "T" v. "C" difference in the mutant sequence versus the 
wild-type sequence, respectively, as shown in Figure 1 , the secondary extension 
products can be treated with the appropriate restriction endonuclease (RE) that 
recognizes the RE site in the secondary extension products derived from the wild- 
type sequence. However, the secondary PCR extension product formed from the 
mutant nucleic acid sequence does not contain a restriction endonuclease site. In 
the embodiment of Figure 1, the restriction endonuclease recognition site 
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incorporated in the secondary extension products derived from the wild-type 
sequence is a Taql recognition site, 5' -TCGA — 3\ Taql cleaves specifically 
within this recognition site, between the T and the C, in each strand of the double 
stranded DNA secondary extension product. However, other restriction 
5 endonuclease sites and their corresponding restriction endonucleases could instead 
be utilized. 

Once the restriction site is incorporated into the high abundance 
target, the appropriate restriction endonuclease is added, under conditions that 
allow the digestion of the restriction site nucleotide sequence. Restriction 

10 endonucleases, derived from bacteria, are enzymes that cut DNA within a 
nucleotide chain. Each restriction endonuclease recognizes specific short 
oligonucleotides from four to eight residues long in a DNA sequence. Under 
appropriate conditions the RE cleaves the each strand at a phosphodiester bond 
within the recognition site. A restriction endonuclease cuts a pure DNA sample 

15 into consistently reproducible fragments that can easily be separated by gel 

electrophoresis. Several hundred restriction endonucleases have been identified 
and are commercially available. See Darnell et al., "Manipulating 
Macromolecules," Molecular Cell Biology . Second Edition, New York, New 
York: W. H. Freeman and Company, pp. 189-225 (1990), which is hereby 

20 incorporated by reference. Any RE can be used in accordance with the method of 
the present invention. The choice of RE to be used is made based upon the 
information available for the DNA in the starting sample. The RE sites that occur 
within a gene are readily available through resources such as GenBank; the 
manufacturer, if the DNA has been obtained commercially, or will have been 

25 identified as part of the study of the gene, prior to the use of the method of the 
present invention. Primer design for the PCR phases of the invention is based 
upon the knowledge of the restriction sites in a gene. RE site identification can be 
readily determined by DNA mapping and DNA cloning. Watson et al., "In Vitro 
Mutagenesis," Recombinant DNA , Second Edition, New York, New York: W. H. 

30 Freeman and Company, pp. 191-194 (1983), which is hereby incorporated by 
reference. Optimal conditions for complete cleavage by a restriction 
endonuclease are specific. Digestion should be carried out per manufacturer's 
recommendations for a particular RE, or as described in Sambrook et al., 
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Molecular Cloning: A Laboratory Manual , Second Edition, Cold Spring Harbor 
Press, NY (1989), which is hereby incorporated by reference. 

After treatment of the secondary extension product with a 
restriction endonuclease, a tertiary PCR process is carried out. As shown in Step 
5 6 of Figure 1, this involves providing a tertiary oligonucleotide primer set having 
a first tertiary primer that is a universal primer (Ul), containing the same sequence 
as the 5' upstream portion of the first oligonucleotide primer of the secondary 
oligonucleotide primer set, and a second tertiary primer containing the same 
sequence as the 5' upstream portion of a second oligonucleotide primer of the 
1 0 secondary oligonucleotide primer set. The set of tertiary oligonucleotide primers 
n are used in amplifying the restriction endonuclease-digested secondary extension 

;4f products. The secondary extension products are blended with the tertiary 

Sy oligonucleotide primer set, and a polymerase to form a tertiary polymerase chain 

■~ reaction mixture. 

^ 1 5 The tertiary polymerase chain reaction mixture is subjected to 

polymerase chain reaction conditions involving denaturation of double stranded 

•7! nucleic acids, hybridization of tertiary oligonucleotide primers to the resulting 

single stranded nucleic acids, and extension of the hybridized primer. This 

13 procedure is repeated through a sufficient number of cycles involving these steps 

^ 20 in order to produce an appropriate amount of tertiary PCR extension products. 

Since the secondary PCR extension products have been treated with a restriction 
endonuclease, the tertiary PCR extension products derived from the wild-type 
nucleic acid sequence are short as a result of their being cleaved proximate to the 
nucleotide corresponding to the distinguishing base. On the other hand, the 
25 tertiary PCR extension products derived from the mutant nucleic acid sequence 
are larger and not cleaved proximate to the nucleotide corresponding to the 
distinguishing base. See Figure 1, Step 6. As a result, the tertiary extension 
products derived from the mutant nucleic acid sequence are readily distinguished 
from those derived from wild-type nucleic acid sequence in the subsequent LDR 
30 step. 

A second restriction endonuclease digest may also be carried out 
following the tertiary PCR phase, to remove any high abundance targets that may 
have been amplified due to PCR error or incomplete digestion in the first RE step. 
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Following the tertiary PCR extension procedure, the resulting 
tertiary extension products are subjected to an LDR procedure according to either 
Step 7 A or Step 7B of Figure 1. In either of these alternatives, the LDR procedure 
begins by first denaturing the tertiary PCR extension products to produce single- 
5 stranded DNA (ssDNA). The discrimination probes are distinguished by having a 
different nucleotide on the 3 ' ends and a different number of adenine bases on 
their 5 5 ends. As a result, ligation products produced from each of the 
discrimination probes are identified by detection of the fluorescent label and 
distinguished from one another on a gel by their differing electrophoretic mobility. 

1 0 In Step 7A of Figure 1 , the LDR probe set contains 4 alternative 

discrimination probes and a common probe, which can ligate to any of the four 
discrimination probes. The common probe contains a fluorescent tag on its 3' 
end. The tertiary extension products, these oligonucleotide probes, and a 
thermostable ligase are combined to form a ligase detection reaction mixture and 

15 subjected to a series of ligase detection reaction cycles. 

The ligase detection reaction is described generally in WO 
90/17239 to Barany et al., "Cloning, Overexpression and Nucleotide Sequence of 
a Thermostable DNA Ligase-encoding Gene," Gene . 109:1-1 1 (1991), and F. 
Barany, "Genetic Disease Detection and DNA Amplification Using Cloned 

20 Thermostable Ligase," Proc. Natl. Acad. Sci. USA . 88:189-193 (1991), and U.S. 
Patent Nos. 5,494,810, 5,830,71 1 and 6,027,889 to Barany, the disclosures of 
which are hereby incorporated by reference. In accordance with the present 
invention, the ligase detection reaction can use 2 sets of complementary 
oligonucleotides. This is known as the ligase chain reaction, which is described in 

25 the immediately preceding references, which are hereby incorporated by 

reference. Alternatively, the ligase detection reaction can involve a single cycle 
that is known as the oligonucleotide ligation assay. See Landegren, et al., "A 
Ligase-Mediated Gene Detection Technique," Science 241:1077-80 (1988); 
Landegren, et al., "DNA Diagnostics — Molecular Techniques and Automation," 

30 Science 242:229-37 (1988); and U.S. Patent No. 4,988,617 to Landegren, et al., 
which are hereby incorporated by reference. 

During ligase detection reaction phases, the denaturation treatment 
is carried out at a temperature of 80-105° C, while hybridization takes place at 50- 
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85° C. Each cycle comprises a denaturation treatment and a thermal hybridization 
treatment which in total is from about one to five minutes long. Typically, the 
ligation detection reaction involves repeatedly denaturing and hybridizing for 2 to 
50 cycles. The total time for the ligase detection reaction phase is 1 to 250 
5 minutes. 

The preferred thermostable ligase is that derived from Thermus 
aquaticus. This enzyme can be isolated from that organism. M. Takahashi, et al., 
"Thermophilic DNA Ligase," J. Biol. Chem. 259:10041-47 (1984), which is 
hereby incorporated by reference. Alternatively, it can be prepared 
10 recombinantly. Procedures for such isolation as well as the recombinant 
^ production of Thermus aquaticus ligase (as well as Thermus themophilus ligase) 

D are disclosed in WO 90/17239 to Barany, et al., F. Barany, et al., "Cloning, 

TJ Overexpression and Nucleotide Sequence of a Thermostable DNA-Ligase 

ij Encoding Gene," Gene 109:1-1 1 (1991), and U.S. Patent Nos. 5,494,810 and 

^ 1 5 5,830,71 1 to Barany, which are hereby incorporated by reference. These 

references contain complete sequence information for this ligase as well as the 

1 ~3 

~ encoding DNA. Other suitable ligases include E. coli ligase, T4 ligase, and 

^ Pyrococcus ligase. 

=j The ligation detection reaction mixture may include a carrier DNA, 

^ 20 such as salmon sperm DNA. 

The hybridization step in the ligase detection reaction, which is 
preferably a thermal hybridization treatment, discriminates between nucleotide 
sequences based on a distinguishing nucleotide at the ligation junctions. The 
difference between the target nucleotide sequences can be, for example, a single 
25 nucleic acid base difference, a nucleic acid deletion, a nucleic acid insertion, or 
rearrangement. Such sequence differences involving more than one base can also 
be detected. 

The oligonucleotide probe sets, as noted above, may have a 
reporter label suitable for detection. Useful labels include chromophores, 
30 fluorescent moieties, enzymes, antigens, heavy metals, magnetic probes, dyes, 
phosphorescent groups, radioactive materials, chemiluminescent moieties, and 
electrochemical detecting moieties. Also appropriate would be the use of 
molecular weight tags for discrimination by mass spectroscopy, such as a matrix 
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assisted laser desorption ionization - time of flight mass spectroscopy (MALDI- 
TOF) array system. 

The oligonucleotide probe sets can be in the form of 
ribonucleotides, deoxynucleotides, modified ribonucleotides, modified 
deoxyribonucleotides, peptide nucleotide analogues, modified peptide nucleotide 
analogues, modified phosphate-sugar-backbone oligonucleotides, nucleotide 
analogs, and mixtures thereof. 

The oligonucleotides of the oligonucleotide probe sets each have a 

hybridization or melting temperature (i.e. T m ) of 66-70°C. These oligonucleotides 

are 20-28 nucleotides long. 

As shown in Step 7 A of Figure 1, the ligation products derived 
from the mutant nucleic acid sequence are formed from the discrimination probe 
having 2 adenine bases on its 5' end. The other discrimination probes will not be 
ligated to the common probe as a result of the mutant nucleic acid sequence being 
present. 

On the other hand, for the most part, no ligation products using the 
above-described probe sets, will be formed from the wild-type sequence. This 
occurs because the endonuclease digestion causes tertiary extension products 
derived from the wild-type nucleic acid sequence to be cleaved, and, as a result, 
both the common and discrimination probes of that probe set will not hybridize to 
such tertiary extension product in a manner permitting them to be ligated together. 
However, to the extent the endonuclease digestion of the secondary extension 
products derived from the wild-type sequence is incomplete, a small residual 
amount of uncleaved tertiary extension product derived from the wild-type 
sequence may be present in the ligase detection reaction mixture. This will cause 
ligation products to be produced from the discrimination probe having no 
adenines at its 5' end. 

Figure 2 shows the gel electrophoresis detection of products 
formed from the ligase detection reaction procedure of Figure 1, Step 7 A. As 
shown in Figure 2, unligated probes have great electrophoretic mobility and form 
a band at the bottom of the gel. Ligation products derived from any small amount 
of tertiary extension products derived from the wild-type sequence have the next 
greatest electrophoretic mobility and form a band at an intermediate location on 
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the gel. Ligation products derived from the mutant sequence have less 
electrophoretic mobility than the wild-type ligation products, and form a thick 
band slightly above that of the wild-type ligation products. If no mutant sequence 
is present in the sample, no such gel band will be formed. The top two bands are 
trace ligation products with low electrophoretic mobility. These high molecular 
weight bands are artifacts, presumably arising from polymerase error and template 
decomposition during PCR. The composition of the PCR buffer can strongly 
affect the fraction of these PCR error products present in the sample after 
amplification. 

The use of capillary and gel electrophoresis to detect DNA 
products based on differences in their electrophoretic mobility is well known. See 
e.g., Grossman, et al., "High-density Multiplex Detection of Nucleic Acid 
Sequences: Oligonucleotide Ligation Assay and Sequence-coded Separation," 
Nucl. Acids Res . 22(21): 4527-34 (1994), which is hereby incorporated by 
reference. 

As an alternative to the ligase detection reaction procedure shown 
in Figure 1 , Step 7B, the discrimination probes can be provided with different 
address-specific portions (i.e. ZIP1, ZIP2, ZIP3, and ZIP4) at their 5' ends. This 
enables the ligation products to be detected on an addressable array having 
different capture oligonucleotide probes which are complementary to the address- 
specific portions on the discrimination probes. As a result, each ligation product 
is directed to a different address of a DNA microarray during a hybridization 
procedure subsequent to the ligase detection reaction phase. The hybridized 
ligation products all have the same label but are distinguished from one another by 
the location on which the ligation products are immobilized. Alternatively, 
different labels can be on the discrimination base, while the address-specific 
portion is on the common probe. In this embodiment, different ligation products 
will be immobilized on the array at locations with the same capture probe; 
however, the different ligation products will be distinguished by different labels. 

Figure 3 shows the detection of ligation products resulting from the 
process of Figure 1, Step 7B using an addressable array. In this case, as shown in 
Figure 1, Step 7B, the common probe has a label and the discrimination probes 
have the address-specific portion. In Figure 3, the ligation products are caused to 
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contact the addressable array under conditions effective to permit address-specific 
portions of the ligation products to hybridize to capture oligonucleotides on the 
array. As shown in Figure 3, Steps 2 and 3, discrimination probes (e.g., probes 
having address-specific portions ZIP4 (not shown) and ZIP3 (shown)) which do 
5 not form ligation products are immobilized on the array but are not detected due to 
the absence of a label. Unligated common probes do not hybridize to the array 
and are subsequently washed away (e.g., at 65°C-80°C and low salt) so they 
produce no signal. Any small amount of ligation products derived from the wild- 
type nucleic acid sequence (due to incomplete upstream endonuclease digestion) 
1 0 are immobilized (together with any of the corresponding unligated discrimination 
^ probe) at the capture oligonucleotide complementary to address-specific portion 

;£j ZIP1 . Similarly, any ligation products derived from the mutant sequence are 

ry. immobilized (together with any of the corresponding unligated probe) at the 

capture oligonucleotide complementary to address-specific portion ZIP2. The 
1 5 presence of mutant sequence and/or wild-type sequence is detected in this 
embodiment by the existence of a fluorescent signal at respective different 
It: locations on the array. Heterozygosity is indicated by equal signals at the capture 

M oligonucleotides complementary to address-specific portions ZIP1 and ZIP2. The 

j=[ signals may be quantified using a fluorescent imager. This format uses a unique 

^ 20 address for each allele and may be preferred for achieving very accurate detection 

of low levels of signal (30 to 100 attomoles of LDR product). 

The use of a solid support with an array of capture oligonucleotides 
is fully disclosed in pending provisional U.S. Patent Application Serial No. 
60/01 1,359, which is hereby incorporated by reference. This method involves 
25 providing a solid support having an array of positions each suitable for attachment 
of an oligonucleotide. A linker or support (preferably non-hydrolyzable), suitable 
for coupling an oligonucleotide to the solid support at each of the array positions, 
is attached to the solid support. An array of oligonucleotides on a solid support is 
formed by a series of cycles of activating selected array positions for attachment 
30 of either single or a multiple nucleotides at the activated array positions. 

When using such arrays, the oligonucleotide probes used in the 
above-described coupled PCR and LDR phases, respectively, have an addressable 
array-specific portion. After the PCR and LDR phases are completed, the 
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addressable array-specific portions for the products of such processes remain 
single stranded and are caused to hybridize to the capture oligonucleotides during 
a capture phase. C. Newton, et al., "The Production of PCR Products With 5' 
Single-Stranded Tails Using Primers That Incorporate Novel Phosphoramidite 
Intermediates," NucL Acids Res. 2 1(5): 1 1 55-62 (1993), which is hereby 
incorporated by reference. 

During the capture phase of the process, the mixture is contacted 
with the solid support at a temperature of 45-90° C and for a time period of up to 
60 minutes. Adding cations, volume exclusion or chaotropic agents may 
accelerate hybridizations. When an array consists of dozens to hundreds of 
addresses, it is important that the correct ligation product sequences have an 
opportunity to hybridize to the appropriate address. This may be achieved by the 
thermal motion of oligonucleotides at the high temperatures used, by mechanical 
movement of the fluid in contact with the array surface, or by moving the 
oligonucleotides across the array by electric fields. After hybridization, the array 
is washed sequentially with a low stringency wash buffer and then a high 
stringency wash buffer. 

It is important to select capture oligonucleotides and addressable 
nucleotide sequences which will hybridize in a stable fashion. This requires that 
the oligonucleotide sets and the capture oligonucleotides be configured so that the 
oligonucleotide sets hybridize to the target nucleotide sequences at a temperature 
less than that which the capture oligonucleotides hybridize to the addressable 
array-specific portions. Unless the oligonucleotides are designed in this fashion, 
false positive signals may result due to capture of adjacent unreacted 
oligonucleotides from the same oligonucleotide set which are hybridized to the 
target. 

The capture oligonucleotides can be in the form of ribonucleotides, 
deoxyribonucleotides, modified ribonucleotides, modified deoxyribonucleotides, 
peptide nucleotide analogues, modified peptide nucleotide analogues, modified 
phosphate-sugar backbone oligonucleotides, nucleotide analogues, and mixtures 
thereof. 

It may be desirable to destroy chemically or enzymatically 
unconverted all unligated LDR oligonucleotide probes that contain addressable 
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nucleotide array-specific portions prior to capture of the ligation products on a 
DNA array. Such unconverted probes will otherwise compete with ligation 
products for binding at the addresses on the array of the solid support which 
contain complementary sequences. Destruction can be accomplished by utilizing 
5 an exonuclease, such as exonuclease III (L-H Guo and R. Wu, Methods in 
Enzymologv 100:60-96 (1985), which is hereby incorporated by reference) in 
combination with LDR probes that are blocked at the ends and not involved with 
ligation of probes to one another. The blocking moiety could be a reporter group 
or a phosphorothioate group. T.T. Nikiforow, et al., "The Use of 
10 Phosphorothioate Primers and Exonuclease Hydrolysis for the Preparation of 
i 3 Single-stranded PCR Products and their Detection by Solid-phase Hybridization," 

;3 PCR Methods and Applications . 3:p.285-291 (1994), which is hereby incorporated 

I y by reference. After the LDR process, unligated probes are selectively destroyed 

by incubation of the reaction mixture with the exonuclease. The ligated probes 
15 are protected due to the elimination of free 3' ends which are required for initiation 
„" of the exonuclease reaction. This approach results in an increase in the signal-to- 

H[ noise ratio, especially where the LDR reaction forms only a small amount of 

M= product. Since unligated oligonucleotides compete for capture by the capture 

g oligonucleotide, such competition with the ligated oligonucleotides lowers the 

u 20 signal. An additional advantage of this approach is that unhybridized label- 

containing sequences are degraded and, therefore, are less able to cause a target- 
independent background signal, because they can be removed more easily from 
the DNA array by washing. 

A wide variety of infectious diseases can be detected by the 
25 method of the present invention. Typically, these are caused by bacterial, viral, 
parasite, and fungal infectious agents. The resistance of various infectious agents 
to drugs can also be determined using the present invention. 

Bacterial infectious agents which can be detected by the present 
invention include Escherichia coli, Salmonella, Shigella, Klebsiella, 
30 Pseudomonas, Listeria monocytogenes, Mycobacterium tuberculosis, 

Mycobacterium avium-intracellular e, Yersinia, Francisella, Pasteurella, Brucella, 
Clostridia, Bordetella pertussis, Bacteroides, Staphylococcus aureus, 
Streptococcus pneumonia, B-Hemolytic strep., Corynebacteria, Legionella, 
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Mycoplasma, Ureaplasma, Chlamydia, Neisseria gonorrhea, Neisseria 
meningitides, Hemophilus influenza, Enterococcus faecalis, Proteus vulgaris, 
Proteus mirabilis, Helicobacter pylori, Treponema palladium, Borrelia 
burgdorferi, Borrelia recurrentis, Rickettsial pathogens, Nocardia, and 
5 Actinomycetes. 

Fungal infectious agents, which can be detected by the present 
invention, include Cryptococcus neoformans, Blastomyces dermatitidis, 
Histoplasma capsulatum, Coccidioides immitis, Paracoccidioides brasiliensis, 
Candida albicans, Aspergillus fumigatus, Phycomycetes (Rhizopus), Sporothrix 
10 schenckii, Chromomycosis, and Maduromycosis . 

Viral infectious agents, which can be detected by the present 
invention, include human immunodeficiency virus, human T-cell 
lymphocytotrophic virus, hepatitis viruses (e.g., Hepatitis B virus and Hepatitis C 
virus), Epstein-Barr virus, cytomegalovirus, human papillomaviruses, orthomyxo 
1 5 viruses, paramyxo viruses, adenoviruses, corona viruses, rhabdo viruses, 

polioviruses, toga viruses, bunya viruses, arena viruses, rubella viruses, and reo 
viruses. 

Parasitic agents which can be detected by the present invention 
include Plasmodium falciparum, Plasmodium malaria, Plasmodium vivax, 

20 Plasmodium ovale, Onchoverva volvulus, Leishmania, Trypanosoma spp., 
Schistosoma spp,, Entamoeba histolytica, Cryptosporidium, Giardia spp., 
Trichomonas spp., Balantidium coli, Wuchereria bancrofti, Toxoplasma spp., 
Enterobius vermicularis, Ascaris lumbricoides, Trichuris trichiura, Dracunculus 
medinesis, trematodes, Diphyllobothrium latum, Taenia spp,, Pneumocystis 

25 carinii, and Necator americanus. 

The present invention is also useful for detection of drug resistance 
by infectious agents. For example, vancomycin-resistant Enterococcus faecium, 
methicillin-resistant Staphylococcus aureus, penicillin-resistant Streptococcus 
pneumoniae, multi-drug resistant Mycobacterium tuberculosis, and AZT-resistant 

30 human immunodeficiency virus can all be identified with the present invention. 

Genetic diseases can also be detected by the method of the present 
invention. This can be carried out by prenatal or post-natal screening for 
chromosomal and genetic aberrations or for genetic diseases. Examples of 
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detectable genetic diseases include: 21 hydroxylase deficiency, cystic fibrosis, 
Fragile X Syndrome, Turner Syndrome, Duchenne Muscular Dystrophy, Down 
Syndrome or other trisomies, heart disease, single gene diseases, HLA typing, 
phenylketonuria, sickle cell anemia, Tay-Sachs Disease, thalassemia, Klinefelter 
5 Syndrome, Huntington Disease, autoimmune diseases, lipidosis, obesity defects, 
hemophilia, inborn errors of metabolism, and diabetes. 

Cancers that can be detected by the method of the present invention 
generally involve oncogenes, tumor suppressor genes, or genes involved in DNA 
amplification, replication, recombination, or repair. Examples of these include: 

10 BRCA1 gene, p53 gene, APC gene, Her2/Neu amplification, Bcr/Abl, K-ras 

gene, and human papillomavirus Types 16 and 18. Various aspects of the present 
invention can be used to identify amplifications, large deletions as well as point 
mutations and small deletions/insertions of the above genes in the following 
common human cancers: leukemia, colon cancer, breast cancer, lung cancer, 

15 prostate cancer, brain tumors, central nervous system tumors, bladder tumors, 
melanomas, liver cancer, osteosarcoma and other bone cancers, testicular and 
ovarian carcinomas, head and neck tumors, and cervical neoplasms. 

In the area of environmental monitoring, the present invention can 
be used for detection, identification, and monitoring of pathogenic and indigenous 

20 microorganisms in natural and engineered ecosystems and microcosms such as in 
municipal waste water purification systems and water reservoirs or in polluted 
areas undergoing bioremediation. It is also possible to detect plasmids containing 
genes that can metabolize xenobiotics, to monitor specific target microorganisms 
in population dynamic studies, or either to detect, identify, or monitor genetically 

25 modified microorganisms in the environment and in industrial plants. 

In the food and feed industry, the present invention has a wide 
variety of applications. For example, it can be used for identification and 
characterization of production organisms such as yeast for production of beer, 
wine, cheese, yogurt, bread, etc. Another area of use is with regard to quality 

30 control and certification of products and processes (e.g., livestock, pasteurization, 
and meat processing) for contaminants. Other uses include the characterization of 
plants, bulbs, and seeds for breeding purposes, identification of the presence of 
plant-specific pathogens, and detection and identification of veterinary infections. 
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The present invention is useful in detecting the presence of one or 
more of a low abundance sequence, that differs from a high abundance sequence 
by one or more single-base changes, insertions, deletions, or translocations when 
the low abundance sequence is present in the same sample in less than a 1 : 1 ,000 
5 molar ratio, preferably, in less than a 1 : 1 0,000 molar ratio, and, most preferably, 
in less than a 1 : 100,000 molar ratio, to the amount of the high abundance 
sequence. 

Another aspect of the present invention is its capability to quantify 
the amount of target nucleotide sequence in a sample. This can be achieved by 

10 establishing an internal standard (i.e. where the standard establishing material is 
amplified and detected with the sample). 

For quantification using an internal standard, a known amount of 
one or more marker target nucleotide sequences is added to the sample. In 
addition, a plurality of marker-specific oligonucleotide probe sets are added along 

15 with the ligase, the previously discussed oligonucleotide probe sets, and the 
tertiary extension products to a mixture. The marker-specific oligonucleotide 
probe sets have (1) a first oligonucleotide probe with a target-specific portion 
complementary to the marker target nucleotide sequence, and (2) a second 
oligonucleotide probe with a target-specific portion complementary to the marker 

20 target nucleotide sequence and a detectable reporter label. The oligonucleotide 
probes in a particular marker-specific oligonucleotide set are suitable for ligation 
together when hybridized adjacent to one another on a corresponding marker 
target nucleotide sequence. However, there is a mismatch which interferes with 
such ligation when hybridized to any other nucleotide sequence present in the 

25 sample or added marker sequences. The presence of ligation product sequences is 
identified by detection of reporter labels. The amount of target nucleotide 
sequences in the sample is then determined by comparing the amount of ligation 
product sequence generated from known amounts of marker target nucleotide 
sequences with the amount of other ligation product sequences. 

30 The present invention also relates to a kit for identifying one or 

more low abundance sequences differing by one or more single-base changes, 
insertions, or deletions, from a high abundance sequence in a plurality of target 
nucleotide sequences. This kit provides a primary oligonucleotide primer set, a 
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secondary oligonucleotide primer set, a tertiary oligonucleotide primer set, and a 
plurality of oligonucleotide probe sets. 

The primary oligonucleotide primer set provided in the kit has (a) a 
first oligonucleotide primer containing a target-specific portion, and (b) a second 
5 oligonucleotide primer containing a target-specific portion. The primary 

oligonucleotide primers are suitable for hybridization on complementary strands 
of a corresponding high and low abundance target nucleotide sequences to permit 
formation of a primary extension product. However, the primers have a 
mismatch which interferes with formation of such a polymerase chain reaction 
10 product when hybridized to any other nucleotide sequence present in the sample. 
^ The secondary oligonucleotide primer set provided in the kit has 

;S (a) a first oligonucleotide primer, containing a target- specific portion and a 5* 

fy upstream secondary primer-specific portion, and (b) a second oligonucleotide 

;Sl primer, containing a target-specific portion and a 5' upstream secondary primer- 

M; 15 specific portion. The secondary oligonucleotide primers are suitable for 

:=* hybridization on complementary strands of the primary extension products to 

permit formation of a secondary extension product which contains or creates a 
M= restriction endonuclease recognition site when amplifying the high abundance 

3 

}=l target, but does not contain or create a restriction endonuclease recognition site 

^ 20 when amplifying the one or more low abundance targets. 

The tertiary oligonucleotide primer set provided in the kit have (a) 
a first tertiary primer containing the same sequence as the 5' upstream portion of 
the first oligonucleotide primer of the secondary oligonucleotide primer set, and 
(b) a second tertiary primer containing the same sequence as the 5' upstream 
25 portion of a second oligonucleotide primer of the secondary oligonucleotide 

primer set. The set of tertiary oligonucleotide primers may be used to amplify all 
of the secondary extension products. 

The kit also provides a plurality of oligonucleotide probe sets. 
Each set has (a) a first oligonucleotide probe, containing a tertiary extension 
30 product-specific portion and a detectable reporter label, and (b) a second 

oligonucleotide probe, containing a tertiary extension product-specific portion. 
The oligonucleotide probes in a particular set are suitable for ligation together 
when hybridized adjacent to one another on a complementary tertiary extension 
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product-specific portion. However, the probes have a mismatch which interferes 
with such ligation when hybridized to any other nucleotide sequence present in the 
sample. 

The kit may also contain a polymerase, an endonuclease restriction 
5 enzyme, and/or a ligase, as described above. 

EXAMPLES 

Optimizing Analog Additions under PCR Conditions 

n Example 1 - Oligonucleotide Synthesis for Analog Efficiency 

; as: 

ry 10 Oligonucleotides were synthesized at the 0.2 (imole scale by 

□ cyanoethyl phosphoramidite chemistry on an Applied Biosystems 394 DNA 
\I synthesizer. Standard 5 00 A CPG columns and reagents (Applied Biosystems) 

were used with the following exceptions: oligonucleotides 50 bases in length 
iTi were synthesized using wide-pore 1000A CPG columns (Applied Biosystems); 

^ 15 oligonucleotides with fluorescent dye FAM at the 5' terminus were synthesized 

□ using FAM phosphoramidite (Applied Biosystems) with a 1 5 minute coupling 

~ step; oligonucleotides with 5' phosphate were synthesized using phosphorylation 

reagent (Glen Research), with a 15 minute coupling step; oligonucleotides with 3' 
blocking group were synthesized using Y-Spacer CPG columns (Glen Research). 

20 Oligonucleotides with the 3' nucleotide analogs 2'-deoxyinosine Q5), 6-(2'- 

deoxy-p-D-ribofuranosyl)-6H, 8H-3 ? 4-dihydropyrimido[4,5-cl[ 1 ? 2]oxazine-7-one 
(Qe) 5 and 2-amino-7-(2'-deoxy-p-D-ribofuranosyl)-6-methoxyaminopurine Q7) 
were synthesized using 2'-deoxyinosine-CPG ? dP-CPG and dK-CPG, respectively 
(Glen Research), as shown in Figure 5. The oligonucleotide primers containing 

25 Qi, Q2 and Qi 8 at the Y-position were synthesized from Qi, Q 2 , and Qig derived- 
CPG synthesized from Qi (Johnson et al. ? "The Synthesis and Stability of 
Oligodeoxyribonucleotides Containing the Deoxyadenosine Mimic 1 -(2'-Deoxy- 
Beta-D-Ribofuranosyl)Imidazole-4-Carboxamide," Nucleic Acid Res. , 25:559-67 
(1997)), Q2 (Bergstrom et al., Journal of the American Chemical Society , 
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1 17:1201-9 (1995)) and Q 18 (Zhang et al., "Exploratory Studies on Azole 
Carboxamides as Nucleobase Analogs: Thermal Denaturation Studies on 
Oligodeoxyribonucleotide Duplexes Containing Pyrrole-3-Carboxamide," Nucleic 
Acids Res. . 26:2208-15 (1998)) by the method of Pon et al. (Pon et al., 
5 "Derivatization of Controlled Pore Glass Beads for Solid Phase Oligonucleotide 
Synthesis," BioTechniques . 6:768-75 (1988)). 

Example 2 - PCR Polymerases and Buffers 

10 The DNA polymerases used were Amplira^, AmpliTaq - Stoeffel 

Fragment, AmpliTaq - Fluorescent Sequencing (Applied Biosy stems), Vent and 
Vent(exo') (New England Biolabs), and Expand polymerase mix ( Taq and Pfu 
polymerase mixture, in Expand High Fidelity kit, Boehringer Mannheim). The 
commercially available PCR buffers used were supplied in the AmpliTag and 

1 5 Expand High Fidelity kits. An alternative buffer CiNF, buffer G(f) 5 is described 
elsewhere (Day et aL, Nucleic Acids Res . (1999)). Briefly, CiNF reactions 
contain 20 mM citrate pH 7.6, 200 ng/ml bovine serum albumin, 2.5 mM MgC12, 
200 jjM dNTP (each) and either 16 mM (NH 4 ) 2 S0 4 , or 50 mM potassium acetate, 
10% formamide, primers, and template DNA. All PCR and LDR reactions 

20 described below were performed under paraffin oil. 

Example 3 - Mismatch Extension Efficiency 

Primers containing natural bases and nucleotide analogs were used 
25 in PCR to compare and measure the efficiency of product formation from 

synthetic duplex p53 exon 7 templates having Mspl (CCGG), Taql (TCGA), Hhal 
(GCGC), or Tail (ACGT) sites at the Mspl position containing codon 248. The 
primers hybridized to wild-type sequence on either side of the Mspl site with the 
3 5 ends of the primers extending one base into the site on each side, see 
30 Figure 6 A. Eight different analogs and the four natural bases were tested in 

parallel reactions on each of the four synthetic templates. PCR was performed 
using Stoeffel Taq or Ta^-Fluorescent Sequencing polymerases with the buffer 
supplied by manufacturer for each polymerase. 10 pmol of each primer and 20 
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fmol of duplex template were used, and 0.2 mM each dNTP and 4 mM MgCl 2 . 
Parallel reactions underwent 10, 20, 30, 40, and 50 PCR cycles of 94°C for 15 sec, 
65°C for 1 min. Efficiency and yield were determined from samples run on 6% 
agarose gels and stained with ethidium bromide. 

5 

Example 4 - Mismatch Conversion Product Sequencing 

Products most efficiently amplified by each analog were diluted 
1 000-fold in water. The diluted DNA products were reamplified for 20 cycles of 
10 94°C for 1 5 sec, 65°C for 2 min. using the same polymerase and buffer as in the 
previous PCR, but with the addition of 10 pmol of "zipcode" containing primers 
p53zip248 and p53zip248, shown in Figure 6A. Zipcode sequences are 
oligonucleotides with no known sequence similarity to DNA sequences in any 
organism. Amplification with zipcode primers is intended to specifically amplify 

1 5 the zipcode containing products of the previous PCR; I e. , only converted DNA 
(containing zipcodes) and not the nearly identical unconverted DNA (lacking 
zipcodes) will be amplified. Conversion products were run on 8% agarose gels 
and bands of the expected size excised. DNA was extracted from the gel slices by 
centrifugation in a 235C microcentrifuge (Fisher) for 30 min through 0.45 jam 

20 HVLP filter (Millipore). The conversion product was dried and resuspended in 
ABI Dye Terminator Cycle Sequencing reaction mix with one of the zipcode 
primers according to kit instructions (Applied Biosystems). An equal volume (3 
\x\ each) of sequencing reaction was combined with dye mix consisting of 83% 
formamide (Eastman), 4 mM EDTA and 8 mg/ml Blue Dextran (Sigma). Samples 

25 were electrophoresed on a 7M urea, 10% acrylamide gel (19:1 bis, 0.6x TBE in 
gel and running buffer) in an ABI 373 DNA Sequencer. Data were analyzed 
using ABI 373A DNA Sequencer Data Analysis software version 1.2.0. 

Example 5 - Conversion Product Identification, With and Without A 
30 Preconversion Step 

Conversion fidelity was tested using nine different synthetic 
templates, with and without preconversion using three primers containing Q 5 , Q 6 , 
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and Q 7 (see Example 1). Preconversion PCR was performed with 3' analog 
primers prior to adding the desired natural base primers, in an effort to avoid 
mismatch primer extension. The 50-basepair duplex DNA templates contained 
the wild-type p53 sequence surrounding codon 248, shown in Figure 6B, except 
5 for the bases corresponding to the Msp\ site (CCGG). The following sequences 
were substituted at the Mspl position: 1) CCGG (wild-type), 2) CTGG, 3) CGGG, 
4) CAGG, 5) TCGA, 6) GCGC, 7) ACGT, 8) ACGT, and 9) GCGC. A pre- 
secondary PCR reaction ("preconversion") was performed with hot start using 50 
fmol/|Lil p53-248Q N and p53-248Q N R primers, and Vent (exo-) in CiNF buffer, 
10 buffer G(f), and 10 fmol/(j,l of duplex template. Preconversion used 2 PCR cycles 

□ of 94°C 15 sec, 55°C 1 min, 60°C 1 min. Product was stored at 4°C. Conversion 
\rt reactions were started with 1 |nl of preconversion reaction containing the same 

polymerase and buffer, but no additional template. Each reaction required 1 0 

□ pmol of each primer, using one of the four pairs p53zip248N and p53zip248NR 
: g 15 (N = C, T, G, or A). Parallel conversion reactions with no preconversion were 

initiated with a hot start by adding 1 0 fmol of synthetic duplex template instead of 
W preconversion reaction aliquot. PCR cycles were as follows: 5 cycles of 94°C 1 5 

G sec, 55°C + 1° per cycle 1 min, 60°C 1 min; then 20 cycles of 94°C 15 sec, and 

3 60°C 2 min. A final extension was performed at 60°C for 5 min. Polymerase was 

20 inactivated by freezing and thawing twice. Products were diluted lOx in water 
and reamplified by adding 1 \il to 20 jil of Expand polymerase and buffer mix. 
PCR was performed for 20 cycles (30 cycles for low yield reactions) of 94°C 1 5 
sec, 65°C 2 min using 12 pmol of zipcode primers Ztop and Zbot (Figure 6). LDR 
was performed as described below to identify the conversion products generated. 

25 

Example 6 - Ligase Detection Reaction 

Ligase detection reactions were performed in standard LDR buffer 
(25 mM Tris pH 7.6, 12 mM MgCl 2 , 65 ^ig/ml bovine serum albumin, 100 mM 
30 KC1, and 10 mM DTT). Each 20 |Lil reaction contained approximately 500 fmol of 
dsDNA (1 jal of sample), 500 fmol of each discriminating probe, and 750 fmol of 
common probe, as shown in Figure 6C. Sets of discriminating and common 
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probes were synthesized to perform LDR on the expected conversion products and 
varied at the bases (Bi) corresponding to the Mspl position sense strand (B1B2B3B4 
= CCGG for wild-type). The discrimination probes had wild- type sequence and 
terminate in -BiB2(-OH 3'). The discrimination probes were synthesized as a set 
5 of four probes each with C, T ? G and A in turn at B2. The common LDR probes 
had (5'P04-)B3B4- followed by wild-type sequence, and hybridized to the 
template with its 5' base adjacent to the 3' base of a discrimination probe. 
Discrimination probes varied the 3' terminal base to identify error products at B2 
of the Mspl position. For simplicity, only B2 was monitored. LDR probes 

10 matched the expected conversion products; for example, conversion of -CCGG- 
template to -ACGT- required discrimination probes ending in -AC, -AT, -AG, and 
-AA, and a common probe with 5' pGT-. Discrimination probes had 5' tails of 
different length and a FAM label for fluorescence detection. The tail length 
allowed physical separation of different LDR products on an acrylamide gel, and 

15 thus identification of the LDR products. 

LDR reactions were preincubated for 1 .5 min at 94°C prior to the 
addition of 5 nmol Tth ligase, followed by 10 LDR cycles of 94°C 15 sec, 65°C 2 
min, and a final hold briefly at 94°C. Reactions were cold quenched and stored at 
-70°C. The LDR products were separated on 10% acrylamide gels containing 7M 

20 urea, with 0.6x TBE (lx TBE: 90 mM Tris base, 90 mM Borate, 2 mM EDTA) in 
the gel and running buffer. Data were collected using an ABI 373 DNA 
sequencer with Genescan 672 software. 

Example 7 - Image Processing 

25 

Gel pictures were produced by the ABI 672 Analysis software. 
Dye specific images were opened in Adobe Photoshop 3.0, cropped, resized, and 
converted to grayscale. The grayscale images were opened in NIH Image 1 .59, 
inverted, and ID vertical background was subtracted. The background-subtracted 
30 images were reinverted and rendered in pseudocolor by Photoshop to make 

intensity differences easier to compare. Except for color replacement, only linear 
image processing was performed to preserve relative intensities. 
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Initial experiments were designed to determine the efficiency of 
generating PCR products when using probes containing 3' terminal nucleotide 
analogs (see above Examples). Eight different analogs were designed to pair with 
more than one of the four natural bases in order to convert one base to another 
base at a specific position in a sequence. Primer pairs containing either a 
nucleotide analog or one of the four natural bases at their 3 ' ends were used to 
amplify four different templates (Figure 6A). Each nucleotide analog and natural 
base was mispaired (or paired) in turn with all four natural bases on the opposite 
strand, and amplification was attempted with either Taq Stoeffel Fragment or Taq 
Fluorescent Sequencing polymerases. The relative amplification efficiency was 
determined by the number of cycles required to generate visible product on an 
ethidium bromide stained agarose gel, as shown in Table 1. The 50-base pair 
synthetic duplex DNA templates containing p53 sequence spanning codon 248 are 
distinguished by the four bases replacing the Mspl site, which are shown. 
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Table 1 



TCGA template 



CCGG 
template 



GCGC 
template 



ACGT 
template 



primer 
3' base 



reads A 
write (eff.) 



reads G 
write (eff.) 



reads C 
write (eff.) 



reads T 
write (eff.) 



T 
C 
G 
A 

Qi 

Q 2 
Qs 
Qe 
Q 7 

Ql6 
Ql8 
Ql9 



A,T 



A.G 



A.T 



(+++) 

(++) 
(++) 

(+) 

(±) 

(±y 

(++) 
(+++) 

(+> 

(+) 1 
(++)« 



A (++) 
G (+++) 
(++) 

(+) 
(±)' 

(±)' 

(++) 
(+++) 

(+) 
(-) 
(±)' 
(-) 



(++) 
(++) 

C (+++) 

T (+++) 

(-) 

(-) 

(+++) 

(++) 
(+++) 

(-) 
<±)' 



T 
C 



T,A 



(++) 

(++) 
(+++) 

(+++) 

(+++) 

1 

(++) 1 

(+++) 

(++) 
(+++) 

(") 

(+++) 



i 



Low product yield. 



Table 1. Extension efficiency and conversion with 3' natural base and nucleotide analog primers. 
Four different templates were used to test primer extension from a 3' base or analog paired in turn 
with A, G, C and T. Relative efficiency was determined by the number of cycles required to 
generate visible product with Taq Stoeffel Fragment polymerase: 10 cycles, (++) 20 cycles, (+) 30 

10 cycles, (±) 40 to 50 cycles, (-) no product. Two of the natural base mismatch primer products were 
sequenced. Generally, the most efficiently amplified template for each analog was reamplified 
with truncated primers and sequenced to determine which bases are written opposite each analog. 
In one case (Ch) a lower efficiency extension product with higher yield was selected for 
sequencing. Mixed base writing preference for some analogs is indicated, with most frequent 

1 5 product listed first. 

Both Taq Stoeffel Fragment and Taq Fluorescent Sequencing 
polymerases generated comparable amounts of product. Perfectly matched natural 
base primers generated visible product after 1 0 cycles, however some analog 

20 primers generated no product after 50 cycles. The analogs that did amplify with 
high efficiency were those that were best able to "read" the opposite strand 
sequence, see Figure 4. 

One product for each analog (as well as the natural base controls) 
were reamplified and sequenced to determine polymerase preference in inserting 

25 nucleotide bases opposite the analog, shown in Table 1. The Qi, Q$, Q6, Qi6 ? and 
Qis primers generated detectable true conversion product, however only Q5 



primers generated almost exclusively true conversion product. No single analog 
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functioned as a "universal base" (Hill et al., "Polymerase Recognition of Synthetic 
Oligodeoxyribonucleotides Incorporating Degenerate Pyrimidine and Purine 
Bases," Proc. Natl. Acad. Sci. USA , 95(8):4258-63 (1998)) capable of generalized 
conversion. Unexpectedly, some products contained sequences that were difficult 
to read across the middle four bases, suggesting single base insertions or deletions 
occurred during PCR extension. This was especially prevalent in products 
generated from mismatched natural bases (see below). 

To test the ability of convertides to reduce mismatch extension 
errors, the effects of preconversion PCR cycles on fidelity were assessed. PCR 
products generated from template amplified with only natural base conversion 
primers were compared to products resulting from two initial PCR cycles using 
convertides followed by selective amplification using specific natural base 
primers. Preconversion PCR was performed with primer pairs containing Q 5? Q 6 , 
and Q 7 analogs, since these convertides had been shown to be the most efficiently 
extended. To improve overall PCR fidelity and 3' mismatch primer extension, 
CiNF buffer, buffer G(f), was used (Day et al., Nucleic Acids Res . (1999)). Nine 
different synthetic duplex templates containing mutated Mspl sites were amplified 
with or without preconversion using 3' analog preconversion primers. Both 
natural base conversion primers and 3' analog preconversion primers were 
designed to manipulate the outside bases CCGG of the Mspl position, shown in 
Figure 6A-B. Some conversions were intended to serve as controls. In these 
cases, the original bases in the template were either restored after analog 
preconversion or never changed with full-length perfect match primers. All steps 
were performed identically between preconversion and non-preconversion 
reactions, except that preconversion reactions used as template the product of 2 
cycles of convertide PCR for succeeding rounds of amplification, while synthetic 
duplex served as the starting material for PCR with no preconversion. In both 
cases, 3 ' natural base primers were used to selectively amplify the desired 
endproduct. These primers contained non-hybridizing zipcode sequences on their 
5' ends, which ultimately served as primer binding sites for the final 20-30 cycles 
of PCR, shown in Figure 6B. Conversion products were quantified by LDR, 
Figure 6C. 
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Figure 7 shows the products of the experiment. Overall, natural 
base mismatch conversion generated greater than 80% incorrect conversion 
products, shown in Figure 7A, lane 9, Figure 7B, lane 1, 3, 5, 7, 15 and 17), but 
preconversion could improve the fidelity and/or the yield of some conversions. In 
5 general, transversions were difficult to achieve even with preconversion. G— >C 
and A^C conversion generated very little of the expected product with either the 
natural base or primers (Figure 7 A, lanes 11-14). Use of Q6 preconversion 
improved the yield of G— >T and A— >T conversion products (compare natural base 
conversion in Figure 7B lanes 11,13 with Qe preconversion in lanes 12 and 14). 

10 In the case of transitions, C^T conversion produced unexpected one-base 

shortened artifacts with natural base mismatch primers on the CXGG templates 
(Figure 7B, lanes 1, 3, 5, 7, 15, and 17), but the correct products were generated 
when using Q6 preconversion, as seen in Figure 7B lanes 2, 4, 6, 8, 16, and 18). 
In addition, primers did improve the yield of the expected T^C conversion 

15 product, shown in Figure 7 A lanes 9 and 10. The controls performed as expected: 
all C^C and T— >T nonconversion reactions worked correctly without 
convertides, seen in Figure 7A lanes 1, 3, 5, 7, 15, and 17, and 5B, lane 9, and the 
corresponding Qe preconversion products were restored to the original sequence, 
shown in Figure 7A lanes 2, 4, 6, 8, 16, and 18, and Figure 7B, lane 10). In 

20 summary, Qe preconversion reduced or eliminated artifacts produced by natural 
base C^T and T^C conversion and facilitated transitions in general. 
Transversions were only partially successful: G— >T and A->T conversions could 
be improved with preconversion, but G^C and A— >C conversion could not be 
achieved. 

25 Apparently correct conversions were observed with attempted 

C— >G and C— >A transversions, however, carefully designed control templates 
revealed that these "conversions" were artifactual. C— >G and C— >A conversion 
appeared to be successful for templates containing a central CpG dinucleotide 
(Figure 8 A and 8B, lanes 1-3, 13-21). However, the same final conversion 

30 products were observed with other templates lacking the central CpG 

dinucleotide, now clearly incorrect. For example, a GCGC product resulted 
during G conversion in reactions where the expected product should have 
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contained T, G or A in the second position (Figure 8 A lanes 4-12). Also, an 
ACGT product resulted during a conversion where the expected product should 
have inserted a non-C base in the second position (Figure 8B lanes 4-12, and 22- 
27). The mismatch primers used to alter the outer bases of the recognition site did 
5 not reach the central dinucleotide, yet these bases were altered. It is doubtful the 
"successful" conversions occurred through the intended mechanism, and thus 
represent fortuitous artifact. The yield of LDR product was low for two 
palindromic templates despite efficient PCR (lanes 22-27 in both Figure 8A and 
Figure 8B). These conversion reaction products presumably contain a large 

10 fraction of insertions or deletions, which cannot be detected by the current set of 
LDR probes. In summary, C^G conversion was partially accomplished by both 
Q 5 (Figure 8A, lanes 5, 8, 1 1, and 23) and the natural base G (Figure 8A, lanes 4, 
7, 10, and 22); however, preconversion does not appear to improve conversion. 
C-»G conversion exhibits sequence dependence. 

1 5 The results of the preconversion study indicate that errors in natural 

base conversion were prevalent, but the use of Qs, Q6 and Q 7 convertides in 
preconversion reduced polymerase error in certain cases. In terms of conversion 
reactions, transitions were easier to accomplish than transversions. This is in 
agreement with previous findings: Newton et al observed more errors in 

20 extension of primers with 3' terminal C-T, A- A, and T-T mismatches 

(transversions) than with purine-pyrimidine mismatches (transitions) (Newton et 
aL, "Analysis of Any Point Mutation in DNA. The Amplification Refractory 
Mutation System (ARMS)," Nucleic Acids Res. 17(7):2503-16 (1989)). Here, 
pyrimidine-pyrimidine conversion usually generated the expected product, 

25 especially when using convertides. In cases of purine-pyrimidine, and 

pyrimidine-purine conversion, incorrect products were often generated. A 
summary of results is shown in Table 2, below. Formation of incorrect 
conversion products can be explained in part by a transient base-pair slippage of 
the primer 3' nucleotide (or analog) to a misaligned position on the template, as 

30 shown in Figure 9. As a result, the sequence following the mismatch is not 

complementary to the original template. Consistent with this hypothesis is the 
observation of unreadable sequence immediately following the analog in initial 
sequencing experiments. Palindromic products, especially CpG dinucleotides, are 
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themselves prone to slippage and extension. Palindromic products were 
frequently produced from non-palindromic templates. These artifacts were 
reduced by the presence of 1 0% formamide in the PCR buffer, presumably 
through destabilization of misaligned structures. Finally, nucleotide analogs 
5 produced fewer artifacts than natural bases. Different analogs produced different 
kinds and quantities of artifacts, perhaps according to their relative ability to base 
pair and stabilize a slippage misalignment. Thus, if polymerase extension is slow 
from an analog poorly base-paired with the template, extension from a strong 
transient base pair generated by slippage could exceed the rate of extension from a 
10 weakly base-paired 3 5 terminal base. 
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First base converted to 

Starting 



Template 8 




C 


T 


G 


A 


1 


CCGG 


c 


Q6 


Qt (FP) 


Qs (FP) 


2 


CTGG 


c 


Q6 


X (err C) 


X (err C) 


3 


CGGG 


c 


Q 6 


Q 5 (err C) 


X (err C) 


4 


CAGG 


c 


Q6 


G (err C) 


X (err C) 


5 


TCGA 


Q6 


T or Q 6 


Q 7 (FP) 


Qs (FP) 


6 


GCGC 


X (err G) 


Q 6 


G 


Qs or Q 7 


7 


ACGT 


X (err G) 


Q6 weak 


Qt 


A or Q7 


8 


CATG 


C 


X 


Qs (err C) 


X (err C) 


9 


CGCG 


C 


Qe 


X 


Q 7 (err C) 



The 50 bp synthetic duplex DNA templates containing p53 sequence spanning codon 248 are 
5 distinguished by the four bases replacing the Mspl site, which are shown. 

Table 2: For the most effective conversions, see Figures 5 and 6. Nine duplex DNA templates 
were used in conversion reactions. Each contained sequence identical to p53 surrounding codon 
248, except the Mspl site was replaced by a different four base sequence (B^BsE^). Bj and B 4 - 
(opposite strand) were simultaneously converted in turn to C, T, G and A either directly by PCR 

1 0 using natural base primers, or by preconversion PCR with nucleotide analog primers followed by 
PCR with natural base primers. In nonconversion control reactions the "conversion" product is 
identical to the original template. A natural base is used to indicate control reactions, and cases in 
which preconversion did not improve conversion. Preconversion was performed using Q 6 to 
facilitate conversion to C and T, and using Q 5 and Q 7 to facilitate conversions to G and A. 

1 5 Conversion primers determine Bi and B 4 ; LDR was performed to detect unintended base changes 
in B 2 (which ideally is unchanged after conversion). Conversion improved by preconversion is 
indicated by the nucleotide analog used. Preconversion equally as effective in control reactions as 
natural base primers is also indicated by the analog used. Low conversion fidelity results in large 
B 2 error. Major B 2 error products are identified (e.g. err C indicates C at B 2 ), and the absence of 

20 correct product indicated no conversion method was successful (X = no correct product). 

Apparently correct product probably formed through a fortuitous mechanism is indicated (FP = 
false positive). 

As discussed earlier, PCR-RFLP has been widely used to detect rare mutations. A 
25 limitation of this technique is reliance on serendipity to yield mutations that can 
be modified to create restriction sites in either the wild-type or the mutant 
template. A second limitation imposed on this approach is the need to avoid using 
3' terminal mismatch primers, since extension from these primers is known to be 
error-prone. To date, the majority of successful attempts have used interrupted 
30 palindromic restriction sites to avoid using 3' terminal mismatch primers. 

Mutations in the cancer causing K-ray and H-ras were detected at a sensitivity of 

r 

1 in 1 0 5 using PCR/RFLP with interrupted palindromic enzymes Xmnl (Kumar et 



al. ? "Oncogene Detection at the Single Cell Level," Oncogene , 3(6):647-51 
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(1988)), AlwNl (Anderson et al., "Prevalence of RAS Oncogene Mutation in Head 
and Neck Carcinomas," J. Otolaryngol. , 21(5):321-6 (1992)), and Bsfril or Mval 
(Urban et al., "Detection of C-Ki-Ras Mutation by PCR/RFLP Analysis and 
Diagnosis of Pancreatic Adenocarcinomas," J. Natl. Cancer Inst. , 85(24):2008-12 
5 (1993); and Ronai et al., "Quantitative Enriched PCR (QEPCR), a Highly 

Sensitive Method for Detection of K-Ras Oncogene Mutation," Hum. MutaU 
10(4):322-5 (1997)). These PCR-RFLP experiments and others (Hattori et al., 
"Mismatch PCR RFLP Detection of DRD2 Ser31 ICys Polymorphism and 
Schizophrenia," Biochem. Biophys. Res. Commun. , 202(2):757-63 (1994); 
10 Beutler et al., "The Facile Detection of the Nt 1226 Mutation of 
_ Glucocerebrosidase by 'Mismatched' PCR," Clin. Chim. Acta. . 194(2-3):161-6 

0 (1990); Hingorani et al., "A Simple Molecular Assay for the CI 166 Variant of the 

1J Angiotensin II Type 1 Receptor Gene," Biochem. Biophys. Res. Commun. , 

ij 213(2):725-9 (1995); Kuwata et al., J Allergy Clin Immunol 96(6 Pt 2): 105 1-60 

=Z 15 (1995); Nishiwaki et al., "Mutational Screening of APP Gene in Patients with 

Early-Onset Alzheimer Disease Utilizing Mismatched PCR-RFLP," Clin. Genet. , 
?J 49(3):1 19-23 (1996); and Ishihara et al., "Analysis of Allelic Variation of the 

* TAP2 Gene in Sarcoidosis," Tissue Antigens , 49(2):107-10 (1997)) avoid 3' 

3 terminal mismatches, however most cancer mutations are in sequences that cannot 

^ 20 be converted to interrupted palindromes, for example at CpG dinucleotides. 

A larger fraction of mutations would be made into targets for detection if 
contiguous recognition sequences could be introduced with as few errors as 
interrupted palindromic recognition sequences. Currently, contiguous restriction 
sites are introduced by terminal 3 ' mismatch primer extension, which is prone to 
25 errors. O'Dell et al. tested a general method for introducing different restriction 
sites at CpG dinucleotides using mismatch PCR (O'Dell et al., "PCR Induction of 
a TaqI Restriction Site at Any CpG Dinucleotide Using Two Mismatched Primers 
(CpG-PCR)," Genome Res. . 6(6):558-68 (1996)). The outer bases of four 
different CpG dinucleotides in the human LDL receptor gene were altered to 
30 create Taq\ (TCGA), Mspl (CCGG) or Rhal (GCGC) sites. In these targets, Taql 
sites were successfully generated by 3' T mismatch primers. The method was 
able to detect homozygous and heterozygous individuals; however, the ratio of 
products representing each allele was not equal, as is expected in germline 
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mutations. Several cases have been shown here where T mismatch conversion 
failed to create a Taql site. Thus the method is sequence dependent. O'Dell et al. 
found that C and G mismatch conversion failed. These results agree with 
O'Dell's conclusion that stronger base pairing leads to mispriming, possibly 
5 through stabilization of primer slippage on the template. Gotoda et al., "Detection 
of Three Separate DNA Polymorphisms in the Human Lipoprotein Lipase Gene 
by Gene Amplification and Restriction Endonuclease Digestion," J. Lipid Res. , 
33(7): 1067-72 (1992), claim to have successfully used PCR-RFLP to introduce a 
MaeW site (ACGT) by extension of a 3' C-A mismatch to produce a T— >C 
10 transition (Gotoda et al., "Detection of Three Separate DNA Polymorphisms in 
!□ the Human Lipoprotein Lipase Gene by Gene Amplification and Restriction 

;2 Endonuclease Digestion," J. Lipid Res. , 33(7):1067-72 (1992)), Athma et al. used 

!y PCR extension of a 3' terminal mismatch primer to create a restriction site for 

S discriminating between two alleles (Athma et al., "Single Base Polymorphism 

1 5 Linked to the Ataxia-Telangiectasia Locus is Detected by Mismatch PCR," 
Biochem. Biophvs. Res. Commun. , 210(3):982-6 (1995)). A G-T mismatch 
;7§ produced an Mval site (CC A/T GG) through an A-»G transition. Successful 

|^ A^G conversions have been performed in accordance with the method of the 

□ present invention, using a natural base mismatch, but difficulties with T^C 

20 conversion by natural base primers has been encountered. Transitions can be 

accomplished more readily than transversions, but the yield of correct product can 
be sequence dependent. Others have also found that PCR-RFLP can produce 
false-positive results (Hodanova et al., "Incorrect Assignment of N370S Mutation 
Status by Mismatched PCR/RFLP Method in Two Gaucher Patients," J. Inherit. 
25 Metab. Pis. , 20(4):61 1-2 (1997)). The use herein of ligase detection reaction 
allows the determination of the precise amounts of misextension products 
generated. 

The fidelity of polymerase extension from primers containing 3' 
natural bases and nucleotide analogs has been measured. Results indicate that 
30 natural base mismatch primer extension cannot be used as a general technique to 
create restriction sites in any given sequence for RFLP analysis. Primer slippage 
appears to be an important mechanism for producing error in mismatch primer 
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extension. This source of error may have a dramatic impact on some allele- 
specific PCR and other methods of high sensitivity mutation detection. With 
further development and testing of nucleotide analogs to facilitate conversion, 
mismatch primer extension may become a technique that can efficiently introduce 
5 desired mutations with few artifacts. It has been found some nucleotide analogs 
improve mismatch primer extension (see Table 3, below). Further improvement 
of 3' mismatch extension will be required to minimize the high degree of context 
dependent error seen in transversions and lead to improved levels of sensitivity 
and broader scope of PCR/RFLP based mutation detection. 

10 

Table 3 
Conversion to 

Starting 



Base C T G A 

~~c c Q6 

T Q 6 T 

G G A or Q 7 

A Qs or Q 7 A 



Table 3. Summary of conversion strategy. A Q n convertide indicates preconversion is required 
1 5 using the indicated convertide prior to final conversion using natural base primers. In some cases, 
an additional convertide or using only the natural base will result in the desired conversion. 

Example 9 - Oligonucleotide Synthesis for PCR/RE/LDR 

20 

Oligonucleotides were synthesized at the 0.2 jumole scale by 
cyanoethyl phosphoramidite chemistry on an Applied Biosystems 394 DNA 
synthesizer. Standard 500A CPG columns and reagents (Applied Biosystems) 
were used with the following exceptions: oligonucleotides 50 bases in length 

25 were synthesized using wide-pore 1000A CPG columns (Applied Biosystems); 
oligonucleotides with fluorescent dye FAM at the 5' terminus were synthesized 
using FAM phosphoramidite (Applied Biosystems) with a 15 minute coupling 
step; oligonucleotides with 5' phosphate were synthesized using phosphorylation 
reagent (Glen Research) with a 15 minute coupling step. Oligonucleotides with 3' 

30 blocking group were synthesized using 3'-Spacer CPG columns (Glen Research). 



-58- 



Oligonucleotides with the 3' nucleotide analog 6-(2-deoxy-B-D-ribofuranosyl)- 
6H ? 8H-3,4-dihydropyrimido[4 ? 5-c][l ? 2]oxazine-7-one (Q 6 ) were synthesized 
using dP-CpG (Glen Research). 

5 Example 10 - PCR Polymerases and Buffers for PCR/RE/LDR 

The polymerases used were AmpliTaq (Applied Biosy stems), Vent 
and Vent(exo-) (New England Biolabs), and Expand polymerase mix (Tag and 
Pwol polymerase mixture, in Expand High Fidelity kit, Boehringer Mannheim). 

10 The commercially available PCR buffers used were supplied in the AmpliTaq and 
Expand kits. Tris pH 9.1 (pH values were measured using 1 M stock solutions at 
23°C), tricine pH 8.7, EPPS (N-[2-hydroxyethyl]piperazine-N'-3-propanesulfonic 
acid) pH 8.4, and citrate pH 7.6 (Sigma) were used for alternative PCR buffers. 
Unless otherwise noted, each 20 |^1 reaction contained 20 mM Tris, tricine, or 

15 citrate, 200 pg/ml bovine serum albumin, 2.5 mM MgC12, 200 jliM dNTP (each) 
and either 16 mM (NH 4 ) 2 S0 4 , or 50 mM potassium acetate. Formamide at 10% 
concentration was used as indicated (see Example 11, below). PCR buffers were 
made as 10% stocks requiring the addition of formamide as needed, dNTPs, and 
the oligonucleotide primers and template DNA. 

20 

Example 11 - Enzyme Buffer Notation 

Test PCR buffers are named to indicate the presence of one or 
more components: Tris/potassium acetate = buffer A; Tris/ammonium sulfate = 
25 buffer B; Tricine/ammonium sulfate = buffer D; EPPS/potassium sulfate = buffer 
E; EPPS/ammonium sulfate = buffer F; and Citrate/ammonium sulfate = buffer G. 
Component concentrations are described above. 

Example 12 - Amplification of p53 exon 7 from Genomic DNA 

30 

Part of p53 exon 7 surrounding codon 248 was amplified as shown 
in Figure 10. The upstream primer has nucleotide sequence corresponding to 

I 
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SEQ. ID. NO. 1, as follows: 

5 ' -GCCTC ATCTTGGGCCTGTGTT ATC-3 ' 

hybridized within the preceding intron, and the downstream primer, with a 
5 nucleotide sequence corresponding to SEQ. ID. NO. 2, as follows: 

5 ' -GTGG ATGGGT AGT AGT ATGG A AG AAATC-3 ' 

hybridized within exon 7. All PCR, restriction endonuclease digestion and 
10 ligation steps described throughout were performed using a GeneAmp PCR 

System 2400 (Perkin-Elmer). Several buffers and enzymes were used as indicated 
in Examples 10 and 1 1 . The p53 exon 7 amplification from genomic DNA was 
performed starting with a 20 jlaI reaction mixture containing 50 ng of DNA, 2.5 
mM of each dNTP, and 12.5 pmol of each primer in lx buffer without polymerase. 
1 5 The reaction mixture was covered with paraffin oil and preincubated for at least 
1 .5 min at 94°C in order to perform hot start by adding 1 \xl of polymerase diluted 
in lx buffer to introduce the required units of polymerase. The exon 7 segment 
was amplified for 40 cycles of 94°C 15 sec, 65°C 2 min, with an additional 5 min 
at 65°C at the end of the last cycle. PCR amplifications departing from this 
20 procedure were performed as indicated. 

Example 13 - PCR/RE/LDR: Fidelity Assay 

Templates were amplified with conversion primer pairs bracketing 
25 the central two basepairs of the Mspl site (CCGG) at codon 248 (Figure 1 IB). 

Tubes were prepared containing 10 fmol per reaction of either PCR amplified p53 
exon 7 or wild-type synthetic duplex template, PCR buffer, and primers. In 
parallel reactions, a synthetic 50 bp duplex marker template (MK), with the 
sequence CGGG replacing the Mspl site at codon 248, was added at 10°, 10"", 
30 10' 5 and 0 molar ratio to wild-type template. Reactions were preincubated for at 
least 1 .5 min at 94°C with all components present in CiNF buffer, buffer G(f), 
except Vent (exo-) polymerase. A "hot start" was performed by adding 1 \x\ of 
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polymerase at 94°C. When preconversion was performed, 2 cycles of 94°C 1 5 
sec, 55°C 1 min, 60°C 1 min were executed with 500 fmol each of the primers 
p53-248Q 6 and p53-248Q 6 R. Afterwards, 1 pmol of p53Taq248T and 
p53Taq248TR primers were added. When preconversion was not performed, the 
5 reactions contained 1 pmol each of the primers p53Taq248T and p53Taq248TR or 
the control primers p53Msp248C and p53Msp248CR. After reactions with and 
without preconversion were performed, conversion PCR was carried out as 
follows: 5 cycles of 94°C 15 sec, 55°C + l°/cyc 1 min (temperature ramp), 60°C 1 
min; then, 20 cycles of 94°C 15 sec, 60°C 2 min; then, a final 60°C 5 min 
1 0 extension. After 3 cycles of the temperature ramp 1 0 pmol of long zipcode 
3 conversion primers (p53zip248T and p53zip248TR, or p53zip248C and 

1 p53zip248CR) were added. After conversion, the wild-type DNA was digested 

: periodically during 20 cycles of "zipcode" PCR (described below). Polymerase 

3 was inactivated by freezing and thawing twice. Finally, LDR was performed to 

z 15 detect the conversion products without contribution from the original template 

(except in nonconversion control reactions). 

] Example 14 - PCR/RE/LDR: "Zipcode" PCR 

20 Wild-type sequences or wild-type conversion products were 

removed by restriction digestion. The appropriate restriction endonuclease was 
added to the reaction tube and supplemented with additional MgCb as required to 
allow efficient digestion. Mspl digestion was performed at 37°C for 15 min using 
no additional MgCl2 ? except when using citrate buffer. Taql digestion was 

25 performed at 65°C for 30 min at 6 mM Mg 2+ by adding 1 jal of enzyme diluted in 
140 mM MgCh. The undigested conversion products were reamplified from 1 \x\ 
of a 1 Ox dilution added to a 20 \x\ PCR reaction containing 10 pmol of the 
"zipcode" primers Ztop and Zbot (Figure 6B). These zipcode primers each 
contain a DNA sequence that is not similar in sequence to any genomic sequences 

30 present in the sample, thus only the products of previous PCR using primers 
containing the zipcode sequences will be efficiently amplified. Conversion 
products were amplified using Expand polymerase mix and buffer (see Example 
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10) After an initial RE digest, zipcode PCR reamplification followed by 
redigestion was performed as follows: reactions were preincubated at 94°C for at 
least 1.5 min then initiated with a hot start by adding 0.1 |ul of RE digested sample 
(1 jal of a lOx dilution) to a 20 jal reaction; 10 cycles of 94°C 15 sec, 65°C 2 min. 
5 Zipcode PCR amplification products were redigested as described above. 

Example 15 - PCR/RE/LDR: Ligase Detection Reaction 

Ligase detection reactions were performed in standard LDR buffer 

10 (25 mM Tris pH 7.6, 12 mM MgCl 2? 65 |ug/ml bovine serum albumin, 100 mM 

KC1 ? and 10 mM DTT). Each 20 \il reaction contained approximately 500 fmol of 
dsDNA (1 jlxI of PCR sample), 500 fmol of each discriminating probe, and 750 
fmol of common probe, shown in Figure 1 1C. Sets of discrimination and 
common probes were synthesized to detect the expected conversion products; i.e., 

1 5 converted to CNGG or TNGA at the Mspl position. The common probe was 

synthesized using 3'-Spacer C3 CPG columns and the 5' end was phosphorylated 
on the column using phosphorylation reagent. Discrimination probes of each set 
varied at the 3' terminal base to query the base in that location, i.e., the second 
base of the Mspl position. Discrimination probes had 5' tails of different length 

20 and a F AM label for fluorescence detection. The tail size identified the probe and 
allowed physical separation of different LDR products on an acrylamide gel. 

The LDR reaction was preincubated for 1 .5 min at 94°C prior to 
the addition of 5 nmol Tth ligase enzyme under a layer of mineral oil. 10 LDR 
cycles of 94°C 1 5 sec, 65°C 2 min. were used. The reactions were then held at 

25 94°C until cold quenched on ice and stored at -70°C. The LDR products were 
separated on 10% acrylamide gels containing 7M urea with 0.6x TBE (lx TBE 
contains 90 mM Tris base, 90 mM Borate, 2 mM EDTA) used in the gel and for 
the running buffer. Data were collected using an ABI 373 automated DNA 
sequencer and Applied Biosystems Genescan 672 software (GS Collection and GS 

30 Analysis). 
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Example 16 - Image Processing for PCR/RE/LDR 

Raw gel pictures were produced by the ABI GS Analysis software. 
Dye specific pictures were opened in Adobe Photoshop 3.0, cropped, resized, and 
5 converted to grayscale. The grayscale images were opened in NIH Image 1.59, 
inverted and ID vertical background was subtracted. Optionally, NIH Image 
could render a 3-D plot from a corrected 2-D picture. Background corrected 
pictures were reinverted and rendered in pseudocolor by Photoshop by replacing 
the color table to make subtle intensity differences easier to compare. Except for 
10 color replacement, only linear image processing was performed in order to 
™ preserve relative intensities. 

m Example 17 -PCR/RE/LDR Optimization 

15 In Figure 10, PCR/RE/LDR was developed to detect and identify 

low abundance mutations occurring within the Mspl site (CCGG) at codon 248 in 
H; the p53 gene. An initial PCR amplifies exon 7 from genomic DNA, shown in 

I s * Figure 10A. This product serves as the template for a second PCR that amplifies 

i4 the central CpG dinucleotide in the Mspl site. To generate a restriction site in 

20 sequence lacking a preexisting site, mismatch primers are used to alter one or 

more bases flanking the CpG dinucleotide. This results in a conversion PCR that 
creates a restriction site (NCGN — > TCGA) Taql site, for example, as in Figure 
10B. In a generalized method for introducing contiguous Type II restriction sites, 
conversion PCR primers by necessity have 3' terminal mismatches. To avoid 
25 unfavorable natural base mismatches that may result in insertion of an erroneous 
base at the next site (O'Dell et aL, Genome Res. , 6(6):558-68 (1996); and Eiken et 
aL, "Application of Natural and Amplification Created Restriction Sites for the 
Diagnosis of PKU Mutations," Nucleic Acids Res. . 19(7): 1427-30 (1991)), 
preconversion with 3 5 nucleotide analog primers is performed. However, 
30 extension with 3' analog primers produces a pool of degenerate products (Day et 
aL, Nucleic Acids Res. , (1999)). Thus, after this preconversion step, natural base 
primers are used to selectively amplify the desired products. 



-63 - 



Mismatch conversion error relative to PCR error was assessed by 
performing parallel nonconversion control reactions and true conversion reactions 
with and without preconversion. Nonconversion reaction products retained the 
Mspl site (CCGG) (see Figure 10A), while conversion introduces a Taql site 
5 (TCGA) (see Figure 10B). All PCR/RE/LDR steps were performed under similar 
conditions, varying only the primers and restriction endonuclease (Mspl or Taql). 
In both cases, noncleavable DNA is preferentially amplified. When wild-type 
DNA is selectively removed by digestion, it is necessary to determine the 
proportion of DNA with incorrect sequence produced relative to the initial 

10 quantity of DNA in the sample, which is nearly 100% wild-type. Parallel 

reactions were performed in which known fractions of marker (MK) DNA were 
present. The MK DNA contained a single base change in the Mspl site (CGGG), 
rendering it uncleavable by Mspl. C^G transversions are unlikely to occur 
through polymerase error. The MK standard curve allows quantification of 

1 5 mutations detected by LDR. Figure 1 0C shows the LDR quantification of mutant 
sequencing using a marker. PCR conditions were tested to minimize PCR error 
(observed in the nonconversion reactions) and mismatch extension errors 
(additional errors observed in the conversion reactions). 

Various proofreading and nonproofreading polymerases were 

20 tested, as different polymerase properties are required during target amplification 
from genomic DNA, conversion, and reamplification steps in PCR/RE/LDR. 
Since it is essential throughout PCR/RE/LDR to minimize any alteration of the 
bases assayed by LDR, proofreading polymerases might seem the logical choice 
for maintaining the highest fidelity (Keohavong et al., PCR Meth. Appl. , 2:288-92 

25 (1993)); however, they may interfere with conversion by mismatch primer 

extension. Hence, PCR conditions must be found which maximize the fidelity of 
nonproofreading polymerases (Keohavong et al., Proc. Natl. Acad. Sci. USA , 
86)23):9253-7(1989)). 

Initially, PCR/RE/LDR was used as a high sensitivity assay to 

30 determine PCR conditions that maintain the highest fidelity throughout the 
procedure. Two main sources of error were expected: 1) polymerase 
misincorporation, and 2) DNA template degradation. Raising the PCR buffer pH 
improves long PCR, probably by decreasing depurination which leads to strand 
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cleavage (Barnes, "PCR Amplification of up to 35-Kb DNA with High Fidelity 
and High Yield from Lambda Bacteriophage Templates," Proc. Natl. Acad. Sci. 
USA , 91(6):2216-20 (1994); Cheng et al., "Effective Amplification of Long 
Targets from Cloned Inserts and Human Genomic DNA," Proc. Natl. Acad. Sci. 
5 USA , 91(12):5695-9 (1994); and Sang et al., "Generation of Site-Directed 
Mutagenesis by Extralong, High-Fidelity Polymerase Chain Reaction," Anal. 
Biochem. , 233(l):142-4 (1996)). While higher pH may decrease template 
damage, higher pH is also known to adversely affect polymerase fidelity (Eckert 
et al., "High Fidelity DNA Synthesis by the Thermus Aquaticus DNA 
10 Polymerase," Nucleic Acids Res. , 18(13):3739-44 (1990); Eckert et ah, "DNA 
,^ Polymerase Fidelity and the Polymerase Chain Reaction," PCR Methods App L 

43 1(1): 17-24 (1991); and Cline et al., "PCR Fidelity of Pfu DNA Polymerase and 

in 

m Other Thermostable DNA Polymerases," Nucleic Acids Res. , 24(1 8):3546-5 1 

2: (1996)). Therefore, other buffers were also tested: tricine, EPPS, and citrate 

^ 1 5 buffers, which have pK a s in the range of 7 to 8 and |ApK a | lower than Tris. Tris 

cannot meet the dual constraints of a slightly alkaline pH at high temperature to 
ri maintain template integrity and neutral pH at the extension temperature to 

M- maintain polymerase fidelity, although most PCR fidelity and long PCR studies 

□ use Tris. Some investigators have explored the use of alternative buffers with 

13 20 lower |ApK a | (Eckert et al., "High Fidelity DNA Synthesis by the Thermus 

Aquaticus DNA Polymerase," Nucleic Acids Res. . 18(13):3739-44 (1990); Eckert 
et al., "DNA Polymerase Fidelity and the Polymerase Chain Reaction," PCR 
Methods AppL , 1(1): 17-24 (1991); and Brail et al., "Improved Polymerase 
Fidelity in PCR-SSCPA," Mutat. Res. , 303(4):171-5 (1993)). Buffer-specific 
25 effects on PCR were tested for purposes of optimizing the buffer conditions of the 
present invention by making PCR buffers containing identical components except 
for the buffering compound. Making one set of test PCR buffers with ammonium 
sulfate and another with potassium acetate tested salt effects. The ApK a of each 
buffer was determined in pure solution and in Ix PCR buffer mixtures (data not 
30 shown). Results of these tests agreed with published ApK a values of pure buffers 
(Good, "Amine Buffers Useful for Biological Research," in Fasman, ed., 
Handbook of Biochemistry and Molecular Biology , Cleveland, Ohio: CRC Press, 
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Inc, pp. 367-369 (1976); and Blanchard, "Buffers for Enzymes/' Meth. Enzymol. , 
104:404-14 (1984)) corrected by a small constant (.005 pH units/°C), possibly due 
to a temperature-dependence of the pH probe itself. The pH of the each test PCR 
buffer was adjusted to produce approximately neutral pH at 65°C. However, the 
5 lx PCR buffers had somewhat different ApK a s compared to the pure buffers; for 
example, lx TsN, buffer B, had ApK a = -.033/°C versus -.030/°C for 100 mM Tris, 
and lx TcK, buffer C, had ApK a = -.022/°C versus -.025/°C for 100 mM tricine. 

Test PCR buffers containing Tris, tricine, or EPPS were used to 
test PCR fidelity with no conversion of the Mspl site (CCGG) at codon 248 of p53 

10 (Figure 12). The objective in this experiment was to test the error rate of PCR 
using various buffers and polymerase enzymes. Since introduced errors create 
template that cannot be cleaved by the selected restriction enzyme, false-positives 
accumulate as this error template continues to amplify alongside true mutant 
DNA. This established the conditions necessary to achieve amplification while 

1 5 minimizing error. The same polymerase and buffer set was used in both 

preamplification of p53 exon 7 from genomic DNA and in the "conversion" step. 
As mentioned, the "conversion" step maintains the Mspl site by using perfectly 
matched primers whose 3 ' ends terminate on the C and G bases flanking the 
central CpG. After an initial Mspl digest, template and amplification products 

20 were periodically redigested every ten cycles during reamplification to remove 
WT sequence. Synthetic marker mutant MK with the sequence CGGG was 
present in these reactions at 10* , 10" 9 or 10" and 0 ratio to wild-type (WT). MK 
will not be cleaved by Mspl restriction digestion, but will amplify with each PCR 
cycle to provide an internal control to measure product quantities (see below). 

25 The MK product will also maintain its sequence, as the perfect match primers in 
the conversion step will again terminate on the C and G bases flanking the central 
GG. Error products resulting from MK PCR will in general lack Mspl sites and 
will be indistinguishable from regular MK template. If an Mspl site is 
accidentally created, the product will be destroyed by digestion. If false LDR 

30 error products are also generate, these can be detected by comparison with a "no 
marker" control. 
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For each buffer, LDR detected MK products in each of the four 
parallel reactions, with the 0 MK control indicating the background level of 
CGGG error produced. The intensities of other error products detected by LDR 
were compared to MK to estimate the fraction of each error product generated. 
5 AmpliTaq generated few transversions (C^G or C— >A), but a large amount of 
C— >T transition was observed, as shown in Figure 12 A. Vent generated much less 
of the C— >T transition compared to AmpliTaq (Figure 12B). AmpliTaq showed 
little dependence on the presence of potassium acetate in buffers A, C, and E 
(Figure 12A lanes 1-4, 9-12, 17-20) versus ammonium sulfate in buffers B, D, and 

10 F (Figure 12A lanes 5-8, 13-16, 21-24). Vent polymerase amplified template 
more efficiently in Tris/ammonium sulfate buffer B than Tris/potassium acetate 
buffer A (Figure 12B lanes 1-4 versus lanes 5-8), as described previously (Barnes, 
"PCR Amplification of up to 35-Kb DNA with High Fidelity and High Yield from 
Lambda Bacteriophage Templates," Proc. Natl. Acad. Sci. USA , 91(6):2216-20 

15 (1994); Keohavong et ah, PCR Meth. AppL . 2:288-92 (1993); Cariello et al., 

"Fidelity of Thermococcus Litoralis DNA Polymerase (Vent) in PCR Determined 
by Denaturing Gradient Gel Electrophoresis," Nucleic Acids Res. , 19(1 5):4193-8 
(1991); and Mattila et al., "Fidelity of DNA Synthesis by the Thermococcus 
Litoralis DNA Polymerase~An Extremely Heat Stable Enzyme with Proofreading 

20 Activity," Nucleic Acids Res. , 19(18):4967-73 (1991)). However, Vent exhibited 
improved fidelity in tricine/potassium acetate buffer C (Figure 12B lanes 9-12) 
and EPPS buffer E (Figure 12B lanes 17-20) compared to tricine/ammonium 
sulfate buffer D (Figure 12B lanes 13-16) and EPPS/ammonium sulfate buffer F 
(Figure 12B lanes 21-24). 

25 The relative fidelities of the different polymerase-buffer 

combinations may be described by their "sensitivity" expressed as the -logio of 
the ratio of MK to WT initially present. The C^T error for AmpliTaq 
amplification in Tris/potassium acetate buffer A can be taken as an example. If 
the signal for the CTGG error product (Figure 12A lane 2) is compared to the MK 

30 CGGG signal (Figure 12A lanes 1-3), the intensity of the signal most resembles 
the 10' 3 MK: WT dilution (Figure 12A lane 1). Thus, the C->T error rate is 10' 3 ; 
the sensitivity is 3, since -log[MK/WT] = -log[10~ J ] = 3. From this it can be seen 
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that the higher the sensitivity, the lower the error rate. Reactions with higher 
sensitivities for each mutation had the highest overall fidelity (Results 
summarized in Table 1). Many of the Vent reactions had sensitivities of 5 (i.e., 1 
in 10 5) for every mutation (Figure 12B), while the AmpliTaq reactions had 
5 sensitivities of 3 (i.e, 1 in 10 3)) ' as shown in Figure 12A. Sensitivity indicates the 
usefulness of the assay rather than the error rate of the polymerase. Error (ER) per 
base per cycle may be estimated from the fraction (F) of all mutations occurring at 
one base which accumulated over 65 cycles (D) before digestion. For purposes of 
the present invention, the number of cycles is an estimate of the number of 
10 duplications, since multiple non-saturating PCRs were performed. From ER=F/D, 

n 

;«! Vent polymerase had an error rate of less than 1 x 10" /base/cycle in 

:~ tricine/potassium acetate buffer C, approximately 2 x 10" /base/cycle in 

\U tricine/ammonium sulfate buffer D, and 2 x 10" 6 /base/cycle in TsN, buffer B. An 

q error rate of 2 x 10" 5 /base/cycle was observed. This was due mainly to the C^T 

% 15 transition for AmpliTaq in tris/potassium acetate buffer A. Elimination of this 

artifact could improve AmpliTaq fidelity by more than 10-fold. Others have used 
y cloning and screening methods to estimate polymerase error (Eckert et al., "High 

^ Fidelity DNA Synthesis by the Thermus Aquaticus DNA Polymerase," Nucleic 

□ Acids Res. . 18(13):3739-44 (1990); Cline et aL, "PCR Fidelity of Pfu DNA 

: ™* 20 Polymerase and Other Thermostable DNA Polymerases," Nucleic Acids Res. . 

24(18):3546-51 (1996); Mattila et al., "Fidelity of DNA Synthesis by the 
Thermococcus Litoralis DNA Polymerase— An Extremely Heat Stable Enzyme 
with Proofreading Activity " Nucleic Acids Res. . 19(18):4967-73 (1991); and 
Huang et al., "Fidelity and Predominant Mutations Produced by Deep Vent Wild- 
25 Type and Exonuclease-Deficient DNA Polymerases During in Vitro DNA 

Amplification," DNA Cell Biol. , 15(7):589-94 (1996)), and denaturing gradient 
gel electrophoresis (DGGE) has also been used (Keohavong et al., PCRMeth. 
AppL , 2:288-92 (1993); Keohavong et al., Proc. Natl. Acad. Sci. USA , 
86)23):9253-7 (1989); Cariello et al., "Fidelity of Thermococcus Litoralis DNA 
30 Polymerase (Vent) in PCR Determined by Denaturing Gradient Gel 

Electrophoresis," Nucleic Acids Res. , 19(15):4193-8 (1991); and Ling et al., 
"Optimization of the Polymerase Chain Reaction with Regard to Fidelity: 
Modified T7, Taq, and Vent DNA Polymerases," PCR Methods Appl. , l(l):63-9 
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(1991)). However, these methods do not directly measure mutated DNA, and do 
not detect all mutations. By cloning and DGGE methods, Vent polymerase has an 
error rate estimated from 0.3 to 4 xlO" 5 /base/cycle (Cariello et al., "Fidelity of 
Thermococcus Litoralis DNA Polymerase (Vent) in PCR Determined by 
5 Denaturing Gradient Gel Electrophoresis," Nucleic Acids Res. , 1 9(1 5):41 93-8 
(1991); and Ling et al., "Optimization of the Polymerase Chain Reaction with 
Regard to Fidelity: Modified T7, Taq, and Vent DNA Polymerases," PCR 
Methods Appl. , l(l):63-9 (1991)). The error rate of Taq polymerase has been 
estimated from 0.8 to 9 xlO" 5 /base/cycle (Eckert et al., "High Fidelity DNA 

1 0 Synthesis by the Thermus Aquaticus DNA Polymerase," Nucleic Acids Res. , 

18(13):3739-44 (1990); Cline et al., "PCR Fidelity of Pfu DNA Polymerase and 
Other Thermostable DNA Polymerases," Nucleic Acids Res. , 24(18):3546-51 
(1996); and Brail et al., "Improved Polymerase Fidelity in PCR-SSCPA," Mutat. 
Res. , 303(4):171-5 (1993)), comparable to the error rate we observed for 

15 Ampli7a<7 in TsK, buffer A. Of the thermostable polymerases, Pfu has the lowest 
reported error rate estimated from 0.7 to 1 .6 XlO" 6 /base/cycle (Cline et al., "PCR 
Fidelity of Pfu DNA Polymerase and Other Thermostable DNA Polymerases," 
Nucleic Acids Res. , 24(18):3546-51 (1996); Lundberg et al., "High-Fidelity 
Amplification Using a Thermostable DNA Polymerase Isolated from Pyrococcus 

20 Furiosus," Gene , 108(1): 1-6 (1991); and Andre et al., "Fidelity and Mutational 
Spectrum of Pfu DNA Polymerase on a Human Mitochondrial DNA Sequence," 
Genome Res. , 7(8):843-52 (1997)). Pfu polymerase may also exhibit improved 
fidelity in tricine or other low |ApK a | buffers. 

While high fidelity proofreading enzymes appeared to improve 

25 amplification from genomic DNA, proofreading still must be avoided in the 

conversion step. Different high fidelity genomic amplification conditions were 
tested in combination with fixed conversion conditions. Genomic amplification 
was performed with either Vent(exo-) Citrate/ammonium sulfate, buffer G, or 
Vent(exo-) Citrate/ammonium sulfate, buffer G, with 1 0% formamide buffer G(f) 

30 (see Table 4). Nonconversion primers were used with Vent(exo-) to optimize 
PCR fidelity in anticipation of conversion by mismatch primer extension. The 
highest fidelity conditions were as follows: Genomic amplifications with Vent/ 
Buffer G were initiated by spiking genomic amplification product from Expand/ 
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Buffer C with 10% formamide reactions after 3 cycles. These Vent/ Buffer G 
reactions required 4 mM Mg 2+ and PCR primers, but no additional genomic DNA 
was provided. See Table 4 for observed error rate with other conditions tested. 
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Table 4 

Polymerase enzymes Buffer Limiting error Error rate 

1 ) Genomic DN A PCR 1 )-*2) total per cycle 



2) conversion 



Taq — > Taq 


A->A 


C->T 


10 J 


2x10"^ 


cc 


B— >B 


a 


10" 3 


2xl0" 5 


ct 


\_ — 




10" 3 


2xl0" 5 


tc 






10" 3 


2xl0" 5 


it 






10" 3 


2xl0" 5 


tt 


r — t r 




10" 3 


2x 1 0" 5 


y — ^ v eru 




O— »T 


>10" 3 


>2x 1 0" 5 


Vpnt v \/pnt 

V C11L 7 V Will 


B->B 


U 


10" 5 


2xl0" 7 


it 




it 


>10~ 5 


>2xl0" 7 


tt 


D->D 


14 


io- 4 


2xl0" 6 


a 


E^E 


Ct. 


>10" 5 


>2xl0" 7 




F— >F 


tt 


10" 5 


2x1 0~ 7 


Vent (exo-) —> 


C->G 


C->T 


IO" 4 


2xl0" 6 


Vent 










Vent (exo-) — > Vent (exo-) 


C->G 




IO" 3 


2xl0" 5 


Vent — > Vent (exo-) 


C->G 




IO* 4 


2xl0" 6 


Vent — > Vent (exo-) 


C->G(f) 




IO" 5 


2xl0" 7 


Vent 0 -> Vent (exo-) 


G(4)-»G 


ct 


IO" 4 


2xl0" 6 


Vent — > Vent (exo-) 


G(8)->G(f) 


< c 


10" 5 


2x 1 0 -7 


Vent c -> Vent (exo-) 


G(4)^G(0 




>10 -5 


>2x 1 0" 7 



a Based on minimum of 50 total cycles, i.e. observed error +50. 

b No vent PCR product from genomic DNA. Used Taq amplified product for Vent conversion PCR. 
c Template added by taking 1 \x\ after 3rd PCR cycle from a parallel genomic DNA amplification using 



5 Expand polymerase mix in Buffer C. 



Table 4. Comparison of fidelity using proofreading and nonproofreading polymerases in different buffers for 
PCR to amplify the target sequence from genomic DNA and for conversion PCR. Taq and Vent polymerases 
were initially tested using one buffer for genomic amplification and conversion. During the conversion step, 

10 only nonconversion of the Msp\ site near p53 codon 248 was performed using short perfect match primers 

(Figure 1 1 A) to determine the background level of polymerase error. LDR quantified Msp\ site mutations at 
the second position (CCGG -> CNGG). Fidelity was compared in parallel reactions using proofreading and 
nonproofreading polymerases in genomic amplification and conversion. Expand polymerase mix, Taq with 
proofreading Pfu polymerase added, was used to initiate target amplification from genomic DNA for 

1 5 subsequent high fidelity Vent polymerase PCR. Vent polymerase was substituted with nonproofreading 
Vent(exo-) to determine whether proofreading was required and also in the conversion step where 
proofreading is not permitted. The effect of 10% formamide in the conversion PCR buffer was also tested. 
All buffers contained 200 [ig/ml bovine serum albumin, 2.5 mM MgCl 2 , and 200 dNTP (each). Specific 
components were: A (TsK), 20 mM Tris pH 9.1, 50 mM potassium acetate (standard Taq polymerase buffer). 

20 B (TsN): 20 mM Tris pH 9.1, 16 mM ammonium sulfate (standard Vent polymerase buffer). C (TcK): 20 
mM tricine pH 8.7, 50 mM potassium acetate. D (TcN): 20 mM tricine pH 8.7, 16 mM ammonium sulfate. 
E (EpK): 20 mM EPPS pH 8.4, 50 mM potassium acetate. F (EpN): 20 mM EPPS pH 8.4, 16 mM 
ammonium sulfate. G (CiN): 20 mM citrate pH 7.6, 16 mM ammonium sulfate, (f) indicates presence of 
10% formamide. (4) indicates increase to 4 mM MgCl 2 - (8) indicates increase to 8 mM MgCl 2 . 

25 

PCR conditions were found for each step in PCR/RE/LDR that 
maintain high fidelity when no mismatch conversion was performed. Using 
known high fidelity PCR conditions, conversion was also tested by changing the 



p53 codon 248 Mspl site (CCGG) into a Taql site (TCGA). MK (CGGG) was 
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added as before in parallel reactions to measure fidelity relative to the initial wild- 
type DNA present. High fidelity PCR was performed as described above and 
some (but not all) reactions were subjected to preconversion. Preconversion was 
performed using primers containing the degenerate pyrimidine nucleotide analog 
5 at their 3' ends, as shown in Figure 10B. The final conversion was 

accomplished using natural base 3' T mismatch primers. Products were detected 
using LDR to interrogate the second base position in the Mspl, Taql and MK 
sequence: CNGG or TNGA. Fidelity for conversion with and without 
preconversion was compared to a nonconversion control. Successful conversion 

10 will change the Mspl site (CCGG) into a Taql site (TCGA); MK will also be 

converted from CGGG to TGGA. However, the main issue of conversion success 
is the maintenance of the central bases in all cases: CpG for Taql conversions and 
GpG for MK. Figure 13 shows the results of conversion. In Figure 13, lanes 1-4 
(C:G) are nonconverted reactions that were digested with Mspl; lanes 5-8 (Qe'G) 

15 are preconverted/converted reactions that were digested with Taql; lanes 9-12 

(T:G) are converted reactions lacking preconversion that were digested with Taql. 
PCR/RE/LDR with no conversion was sensitive to better than 1 in 10 4 using the 
previously determined best conditions for preamplification and conversion shown 
in Figure 13, C:G lanes 1-4. PCR/RE/LDR with conversion of the Mspl site to a 

20 Taql site by T mismatch primers was apparently very successful at first glance, 
Figure 13, T:G lanes 9-12. As would be expected for successful conversion, no 
Mspl product can be detected in the _CG_ region of the figure; hence, it appears 
that the site was converted to Taql and then digested. However, although a very 
large fraction of MK (CGGG) is observed in the reactions with added MK, 

25 Figure 13, T:G lanes 9-11; the same large fraction is also observed in the 0 MK 
control lane in Figure 13, T:G lane 12. Thus, the entire quantity of MK is an 
artifact produced by mismatch extension of the natural base T primers. This event 
would convert the second position C in the Mspl site to a G during extension, 
mimicking the internal sequence of MK (CCGG^TGGA). Preconversion with 

30 Q6 primers eliminates the MK artifact, Qe:G lanes 5-8. 

The greater amount of WT present in nonconverted samples, 
shown in Figure 13 lanes 1-4, compared to Qe converted samples, lanes 5-8, may 
be due to inhibition of Mspl digestion by formamide. Formamide apparently 



-72- 



inhibits Mspl digestion as evidenced by the presence of strong wild-type LDR 
bands (WT) in the nonconversion control, C:G lanes, which are not present after 
Taql digestion of the converted sequence, Qe'.G and T:G lanes. 

The low amount of MK product seen in the Q 6 10 _4 and 10" 5 MK 
5 lanes (Figure 13, Qe:G lanes 6, 7) compared to the respective nonconversion 

control reactions, (Figure 13,C:G lanes 2, 3) may be due to low efficiency of MK 
conversion. The production of a Taql site actually requires two conversions, one 
on each side of the central CpG dinucleotide. Lowering the concentration of MK 
1 0-fold may reduce MK conversion far more than 1 0-fold. Thus, with only one 

10 side of the MK sequence converted in a large amount of its product, one half of 
the LDR probes will be unable to properly hybridize to this sequence and ligation 
will not occur. LDR detection will only reveal the lesser quantity of fully 
converted template. Nevertheless, the amount of MK product is greater than the 
control in these two lanes. Compare Figure 13, lane 8 to Figure 13, lanes 6 and 7. 

1 5 While formamide may reduce conversion efficiency, conversion fidelity is greatly 
improved. 

Newton et al., "Analysis of Any Point Mutation in DNA. The 
Amplification Refractory Mutation System (ARMS)," Nucleic Acids Res. , 
17(7):2503-16 (1989) found that C • T, A • A, and T • T mismatches were all far 

20 more difficult to extend with Taq polymerase than purine-pyrimidine mismatches. 
These results reflect PCR efficiency of extension rather than fidelity. Others have 
observed low fidelity in extending natural base mismatches (O'Dell et al., 
Genome Res. , 6(6):558-68 (1996); and Eiken et al., "Application of Natural and 
Amplification Created Restriction Sites for the Diagnosis of PKU Mutations," 

25 Nucleic Acids Res. , 19(7):1427-30 (1991)). Use of a nucleotide analog with 
structural similarities to multiple bases could potentially be used to allow 
polymerase extension (reading) from the analog when paired with different bases 
and insertion of different bases opposite the analog (writing). For the purposes of 
the present invention, the efficiency of the preconversion process need not 

30 necessarily be high. However, successful conversion requires high PCR fidelity 
to ensure that only the bases targeted for conversion are altered. False-positive 
mutation artifacts will result from alterations of bases not targeted for conversion 
within the sequence probed for mutations. Preconversion using 3' Q6 primers 
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forming a Qe-G mismatch avoids starting polymerase extension with a G • T 
mismatch. In subsequent amplification cycles, A is apparently written frequently 
opposite Q 6 . This observation is consistent with the results of Hill et al. in which 
Q 6 base-paired like C and T with nearly equal frequency (Hill et al., "Polymerase 
5 Recognition of Synthetic Oligodeoxyribonucleotides Incorporating Degenerate 
Pyrimidine and Purine Bases," Proc. Natl. Acad. Sci. USA , 95(8):4258-63 
(1998)). Facile tautomerization allows Q 6 to mimic either pyrimidine when base- 
paired and avoids mismatch wobble. When the natural base primer is added after 
preconversion, a significant quantity of perfect match template already exists, 
10 otherwise the MK artifact would appear in the reaction regardless of 

preconversion. Other nucleotide analogs in addition to Q 6 may serve as a bridge 
for more efficient conversions (Day et al., Nucleic Acids Res. , (1999)). 

The fidelity of polymerase extension from primers in PCR has been 
measured, and conditions were found, which in some cases improve fidelity. 
1 5 Presumably, higher fidelity resulted from a decrease in polymerase 

misincorporation, primer slippage, and template degradation. PCR/RE/LDR 
allows the measurement of very low-level "mutant" sequences by preferentially 
amplifying non-wild-type sequences. The method of the present invention clearly 
demonstrates that natural base mismatch primer extension cannot be used as a 
20 general technique to create restriction sites at will in any sequence for RFLP 

analysis. As shown in Figure 13 and observed previously (O'Dell et al., Genome 
Res. , 6(6):558-68 (1996); and Eiken et al., "Application of Natural and 
Amplification Created Restriction Sites for the Diagnosis of PKU Mutations," 
Nucleic Acids Res. , 19(7): 1427-30 (1991)), natural base mismatch extension is 
25 prone to error. To perfectly engineer a restriction site from existing sequence, an 
error-free approach is required. The results of these examples, using the method 
of the present invention, indicates that the use of nucleotide analogs combined 
with high fidelity PCR conditions may radically decrease errors. Monitoring the 
true specificity of primer extension was possible in these studies because LDR can 
30 measure specific PCR errors accurately and with high sensitivity. Thus, the 

products of different polymerases and buffers could be assayed at different steps 
during PCR/RE/LDR to maximize both PCR efficiency and fidelity. As a result, a 
PCR/RE/LDR strategy could be assembled to achieve the goal of 10 5 sensitivity. 
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However, this highest sensitivity was achieved only in the special case of no 
conversion at a preexisting Mspl site. At this time, primer slippage remains an 
important mechanism through which mismatch primer extension errors can arise 
(Day et al., Nucleic Acids Res. , (1999)). Although the importance of this source 
5 of error in vivo is uncertain, it may have a dramatic impact on allele-specific PCR 
and other in vitro methods of mutation detection. An additional source of error 
arises from using natural base primers to select specific sequences for 
amplification following preconversion with nucleotide analogs. This is because a 
fraction of the selective natural base primers may form a mismatched pair with 

10 bases other than the intended base. It is known that a characteristic set of different 
bases insert opposite nucleotide analogs (Day et al., Nucleic Acids Res. , (1999); 
and Hill et al., "Polymerase Recognition of Synthetic Oligodeoxyribonucleotides 
Incorporating Degenerate Pyrimidine and Purine Bases," Proc. Natl. Acad. Sci. 
USA , 95(8):4258-63 (1998)). Thus, a high fidelity mismatch primer extension 

1 5 protocol awaits the development of new convertides that can overcome these 
problems. In combination with high fidelity PCR and LDR monitoring of 
efficiency, mismatch primer extension may become a technique for the precise 
introduction of desired mutations without artifacts. 

Although the invention has been described in detail for the purpose 

20 of illustration, it is understood that such details are solely for that purpose. The 
variations can be made therein by those skilled in the art without departing from 
the spirit of the scope of the invention which is defined by the following claims. 



